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[57] ABSTRACT 

In accordance with the present invention, biologically active 
protein fragments can be constructed which contain only 
those specific portions of the serum albumin family of 
proteins such as regions known as subdomains HA and IHA 
which are primarily responsible for the binding properties of 
the serum albumins. The artificial serums that can be pre- 
pared from these biologically active protein fragments are 
advantageous in that they can be produced much more easily 
than serums containing the whole albumin, yet still retain all 
or most of the original binding potential of the full albumin 
proteins. In addition, since the protein fragment serums of 
the present invention can be made from non- natural sources 
using conventional recombinant DNA techniques, they are 
far safer than serums containing natural albumin because 
they do not carry the potentially harmful viruses and other 
contaminants that will be found in the natural substances. 

11 Claims, 4 Drawing Sheets 
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DAHKSEVAHR FKDLGEENFK ALVLIAFAQY LQQCPFEDHV KLVNEVTEFA KTCVADESAE 

DTHKSEIAHR FKDLGEEHFK GLVLIAFSQY LQQCPFDEHV KLVNELTEFA KTCVADESHA 

DTHKSEIAHR FNDLGEKHFK GLVLVAFSQY LQQCPFEDHV KLVNEVTEFA KKCAADESAE 

DTHKSEIAHR FNDLGEENFQ GLVLIAFSQY LQQCPFDEHV KLVKELTEFA KTCVADESHA 

EAHKSEIAHR FKDLGEQHFK GLVLIAFSQY LQKCPYEEHI KLVQEVTDFA KTCVADENAE 

VDHHKHIADM YNLLTERTFK GLTLAIVSQN LQKCSLEELS KLVNEINDFA KSCTGNDKTP 

SQAQNQICTI FTEAKEDGFK SLILVGLAQN LPDSTLGDLV PLIAEALAMG VKCCSDTPPE 

I — h4 (I) 1 i-h5(I)-i | — h6(I) — | r 

61 | 71 81 | 91 | | 101 | 111 | 

NCDKSLHTLF GDKLCTVATL RETYGEMADC CAKQEPERNE CFLQHKDDNP NLPRLVRPEV 

GCEKSLHTLF GDELCKVASL RETYGDMADC CEKQEPERNE CFLSHKDDSP DLPKL*KPDP 

NCDKSLHTLF GDKLCTVATL RATYGELADC CEKQEPERNE CFLTHKDDHP NLPKL*KPEP 

GCDKSLHTLF GDELCKVATL RETYGDMADC CEKQEPERNE CFLNHKDDSP DLPKL*KPEP 

NCDKSIHTLF GDKLCAIPKL RDNYGELADC CAKQEPERNE CFLQHKDDNP NLPPFQRPEA 

ECEKPIGTLF YDKLCADPKV GVNYEWSKEC CSKQDPERAQ CFRAHRVFEH N*P**VRPKP 

DCERDVADLF QSAVCSSETL VEKN*DLKMC CEKTAAERTH CFVDHKAKIP RDLSLKAELPA 

h7 (I)-i | h8(I) 1 i h9(I) 1 I 

121 I | 131 141 151 161 I 171 I 

DVMCTAFHDN EETFLKKYLY EIARRHPYFY APELLFFAKR YKAAFTECCQ AADKAACLLP 

NTLCDEFKAD EKKFWGKYLY EIARRHPYFY APELLYYANK YNGVFQECCQ AEDKGACLLP 

DAQCAAFQED PDKFLGKYLY EVARRHPYFY GPELLFHAEE YKADFTECCP ADDKLACLIP 

DTLCAEFKAD EKKFWGKYLY EVARRHPYFY APELLYYANK YNGVFQECCQ AEDKGACLLP 

EAMCTSFQEN PTSFLGHYLH EVARRHPYFY APELLYYAEK YNEVLTQCCT ESDKAACLTP 

EETCALFKEH PDDLLSAFIH EEARNHPDLY PPAVLLLTQQ YGKLVEHCCE EEDKDKCFAE 

ADQCEDFKKD HKAFVGRFIF KFSKSNPMLP PHWLAIAKG YGEVLTTCCG EAEAQTCFDT 

hlO(I) — hi (II) 1 | h2 (II) 1 | h3 (II) — 

181 201 I 211 221 I 231 

KLDELRDEGK ASSAKQRLKC ASLQKFGERA FKAWAVARLS QRFPKAEFAE VSKLVTDLTK 

KIETMREKVL ASSARQRLRC ASIQKFGERA LKAWSVARLS QKFPKAEFVE VTKLVTDLTK 

KLDALKERIL LSSAKERLKC SSFQNFGERA VKAWSVARLS QKFPKADFAE VSKIVTDLTK 

KIDAMREKVL ASSARQRLRC ASIQKFGERA LKAWSVARLS QKFPKADFTD VTKIVTDLTK 

KLDAVKEKAL VAAVRQRMKC SSMQRFGERA FKAWAVARMS QRFPNAEFAE ITKLATDVTK 

KMKELMKHSH SIEDKQKHFC WIVNNYPERV IKALNLARVS HRYPKPDFKL AHKFTEETTH 

KKATFQHAVM KRVAELRSLC IVHKKYGDRV VKAKKLVQYS QKMPQASFQE MGGMVDKIVA 


FIG. 2-1 
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241 

vhtecchgd: 

VHKECCHGDL 

VHKECCHGDL 

VHKECCHGDL 

INKECCHGDL 

FIKDCCHGDM 

TVAPCCSGDM 

301 

DLPSLAADFV 

NLPPLTADFA 

DIPALAADFA 

NLPPLTADFA 

DLPSIAADFV 

ELSKPITEFT 

GLSEHYDIHA 


251 


h4 (ID- 


261 


LECADDRADL AKYICENQDS ISS 


ih5 (II) rh6 (II}i 

271 I 281 291 

jKECCE KPLLEKSHCI AEVENDEMPA 

LECADDRADL AKYICDNQDT ISS KLKECCD KPLLEKSHCI AEVEKDAIPE 

LECADDRADL AKYICEHQDS ISG KLKACCD KPLLQKSHCI AEVKEDDLPS 

LECADDRADL AKYICDHQDA LSS KLKECCD KPVLEKSHCI AEVDKDAVPE 

LECADDRAEL AKYMCENQAT ISS KLQACCD KPVLQKSQCL AETEHDNIPA 

FECMTERLEL SEHTCQHKDE LST KLEKCCN LPLLERTYCI VTLENDDVPA 

VDEVCADESV LSRAAGLSACCK EDAVHRGSCV EAMKPDPKPD 


VTCMKERKTL 
-h7(II)n 
321 


311 r 

ESKDVCKNYA 

EDKDVCKNYQ 

EDKEICKHYK 

EDKEVCKNYQ 

EDKEVCKNYA 

EDPHVCEKYA 

DIAAVCQTFT 


-h8 (II) 


r l 

331 

EAKDVFLGMF LYEYARRHPD 
EAKDAFLGSF LYEYSRRHPE 
DAKDVFLGTF LYEYSRRHPD 
EAKDVFLGSF LYEYSRRHPE 
EAKDVFLGTF LYEYSRRHPD 
ENKS*FL*EI SPWQSQETPE 
KTPDVAMGKL VYEISVRHPE 


-h9(II) 


341 

YSWLLLRLA 

YAVSVLLRLA 

YSVSLLLRIA 

YAVSVLLRLA 

YSVSLLLRLA 

LSEQFLLQSA 

SSQQVILRFA 


351 

KTYETTLEKC 

KEYEATLEEC 

KTYEATLEKC 

KEYEATLEDC 

KKYEATLEKC 

KEYESLLNKC 

KEAEQALLQC 


361 
CAAHDPHECY 
CAKDDPHACY 
CAEADPPACY 
CAKEDPHACY 
CAEGDPPACY 
CFSDNPPECY 
CDMEDHAECV 


371 

AKVFD 

STVFD 

RTVFD 

ATVFD 

GTVLA 

KDGAD 


hlO(II) — hi (III) 

381 391 


401 


r 


h2 (III) — | 
411 I 


EFKPL VEEPQNLIKQ NCELFKQLGE YKFQNALLVR YTKKVPQVST 
KLKHL VDEPQNLIKQ NCDQFEKLGE YGFQNALIVR YTRKVPQVST 
QFTPL VEEPKSLVKK NCDLFEEVGE YDFQNALIVR YTKKAPQVST 
KLKHL VDEPQNLIKK NCELFEKHGE YGFQNALIVR YTRKAPQVST 
EFQPL VEEPKNLVKT NCELYEKLGE YGFQNAVLVR YTQKAPQVST 
RFMNE AKERFAYLKQ NCDILHEHGE YLFENELLIR YTKKMPQVSD 


KTALAGSDIDKKI TDETD*YYKK MCAAEAAVSD DSFEKSMMVY YTRIMPQASF 


i h3 (III) 1 

1 

— h4 (III) — 

1 

rh5 (III)] 

391 
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ph6 (III)— i 
4811 491 

LVNRRPCFSA LEVDETYVPK 
LVNRRPCFSA LTPDETYVPK 
LAERRPCFSA LELDEGYVPK 
LVNRRPCFSD LTLDETYVPK 
LVERRPCFSA LTVDETYVPK 
YSGMRSCFTA LGPDEDYVPP 
YSMRRHCILA IQPDTEFTPP 

i — h9 (III) 1 

541 551 

KEQLKAVMDD FAAFVEKCCK 
EEQLKTVMEN FVAFVDKCCA 
KEQLKTVLGN FSAFVAKCCG 
DEQLKTVMEN FVAFVDKCCA 
EDQLKTVMGD FAQFVDKCCK 
KNHIDEWLLE FLKMVQKCCT 
EDHLKTISTK YHTMKEKCCA 


rh7 (III) r 
501 511 I I 

EFNAETFTFH ADICTLSEKE 
AFDEKLFTFH ADICTLPDTE 
EFKAETFTFH ADICTLPEDE 
PFDEKFFTFH ADICTLPDTE 
EFKAETFTFH SDICTLPDKE 
PVTDDTFHFD DKICTANDKE 
ELDASSFHMG PELCTKDSKD 

I hlO(III) — 

561 I 571 

ADDKETCFAE EGKKLVAASQ 
ADDKEACFAV EGPKLWSTQ 
REDKEACFAE EGPKLVASSQ 
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AEDQAACFTE EAPKLVSESA 
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521 
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BIOLOGICALLY ACTIVE PROTEIN 
FRAGMENTS CONTAINING SPECIFIC 
BINDING REGIONS OF SERUM ALBUMIN 
OR RELATED PROTEINS 

This application is a continuation of application Ser. No. 
08/024,547, filed Mar. 1, 1993, now abandoned. 

FIELD OF THE INVENTION 

The invention relates to the specific binding regions of 
serum albumin and related proteins and to biologically 
active protein fragments containing these specific binding 
regions that can be safely and economically produced using 
conventional recombinant DNA techniques. 

BACKGROUND OF THE INVENTION 

The serum albumins belong to a multigene family of 
proteins that includes alpha -fetoprotein (AFP) and human 
group- specific component (Gc) or vitamin D-binding pro- 
tein. The members of this multigene family are typically 
comprised of relatively large multi-domain proteins, and the 
serum albumins are major soluble protein constituents of the 
circulatory system which have many physiological func- 
tions. The albumins and their related proteins contribute 
significantly to colloid osmotic blood pressure and aid in the 
transport, distribution and metabolism of many endogenous 
and exogenous ligands. These ligands represent a spectrum 
of chemically diverse molecules, including fatty acids, 
amino acids (notably tryptophan and cysteine), steroids, 
metals such as calcium, copper and zinc, and numerous 
pharmaceuticals. They are thought to facilitate transfer of 
many ligands across organ-circulatory interfaces such as the 
liver, intestine, kidney and brain, and evidence suggests the 
existence of an albumin cell surface receptor (see Schnitzer 
et al„ PNAS 85:6773 (1988)). 

In addition, serum albumins are also found in tissues and 
secreted fluids throughout the body. For example, it is 
estimated that albumin in evascular protein comprises 60% 
of the body’s total albumin. In humans, human serum 
albumin, or HSA, is a protein of about 65,000 daltons in 
molecular weight and contains 585 amino acids. Its amino 
acid sequence contains a total of 17 disulphide bridges, one 
free thiol (Cys 34), and a single tryptophan (Trp 214). The 
disulphides are positioned in a repeating series of nine 
loop-link-loop structures centered around eight sequential 
Cys — Cys pairs. 

Studies of serum albumins have been made on a variety 
of animal species, and it has been determined that approxi- 
mately 61% of the amino acid sequences are conserved 
among the known sequences of bovine, rat and human serum 
albumins. More recently, additional sequences for the albu- 
mins have been determined with regard to a wide ranging 
group of vertebrates including sheep, frog, salmon, mouse, 
pig and even sea lampreys. Most of these proteins share high 
sequence homology and all of them share the characteristic 
repeating series of disulphide bridges. All members of the 
albumin multigene family for which sequences have been 
determined have internal sequence homology (from two- to 
seven -fold), suggesting that the proteins evolved from a 
common ancestral protein of possibly about 190 amino 
acids. Other studies have confirmed this homology (see, e.g.. 
Carter et al„ Science 244:1195 (1989)). 

Currently, there are literally thousands of applications for 
serum albumin protein and its related proteins, Gc and AFP, 
and most often these applications have used the native serum 
albumin family of proteins obtained from bovine or human 
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sources. Unfortunately, at present, the numerous concerns 
with regard to the safety of albumin-containing plasma 
isolated from natural sources have greatly restricted the 
availability of albumin proteins for many of these applica- 
5 tions. Included among these concerns is the heightened 
possibility that the plasma from which the albumins are 
obtained will be infected with various viral contaminants 
including HTV or other AIDS-related viruses, Hepatitis-B, 
herpes, and a number of other potentially pathogenic micro- 
10 organisms. 

Because of these concerns, there have been many attempts 
to prepare recombinant DNA sequences coding for serum 
albumins which can be used in the artificial production of 
this important molecule. However, unfortunately, these 
15 attempts have also been generally unsuccessful because of 
the fact that like most large proteins, serum albumins 
denature quite readily and are practically impossible to 
produce in usable quantities by genetic engineering. It thus 
has remained a problem to develop artificial serum solutions 
20 which are stable and which can maintain the biologically 
activity of natural serum albumins. 

Clearly, the utility of the serum albumin molecules is 
based in large part in their ability to bind and thus transport 
a wide variety of important macromolecules so as to regulate 
25 a number of physiological functions in humans and animals. 
However, although the binding properties of serum albumin 
have been well-established, the precise nature and location 
of those binding regions have not. Thus, although certain 
amino acid sites, such as Lys 199 and Tyi 411 have been 
30 identified as involved in acetylation (see Hagag et al„ 
Biochemistry 22:2420 (1983)) and esterification (see Sol- 
lene et al., Molec. Pharmac. 14:754 (1979)), very little has 
been previously been known about the binding sites of the 

serum albumins. 

35 

There has thus been a long-felt and unfulfilled need in the 
art to identify specific binding sites in the serum albumin 
family of proteins so as to allow the large-scale production 
of protein fragments having the same binding properties and 
40 biological activity as whole serum albumins. Since such 
smaller genetically engineered polypeptides are much more 
easily expressed and produced in large quantities than the 
full albumins, the identification of these specific binding 
sites would make commercial isolation and production of 
45 artificial polypeptides having all of the same binding prop- 
erties of natural albumins much more economically and 
technically feasible. 

SUMMARY OF THE INVENTION 

50 In accordance with the present invention, it has now been 
discovered that specific portions of the serum albumin 
multigene family of proteins, specifically those portions 
known as subdomains HA and HIA, are primarily respon- 
sible for the binding properties of serum albumin and its 
55 related proteins, and that biologically active artificial serums 
prepared from protein fragments containing at least one of 
these binding regions can be produced much more easily 
than serums containing the whole protein. In particular, the 
sequence for binding subdomain HA appears to be from 
60 about amino acids 190 through 300 on the albumin 
molecules, and subdomain IHA appears to be located on the 
polypeptide at roughly from amino acid 380 to about amino 
acid 495. 

Further, it also appears that a fusion product, which 
65 includes not only the above binding subdomains IIA and 
IHA but an additional region HB, is also useful in binding, 
and this fusion product is coded on the polypeptide at about 
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amino acid 190 through 495. The discovery that the binding 
of the albumin family of proteins is based primarily on these 
specific binding regions will thus allow for the production of 
protein fragments containing one or more of these binding 
regions which are capable of exhibiting the same biological 
activity as the whole albumin protein. 

It is thus an object of the present invention to provide 
protein fragments containing at least one of the binding sites 
from the serum albumin family of proteins so as to allow the 
production of biologically active serum which does not 
contain albumin family proteins obtained from natural 
sources. 

It is further an object of the present invention to provide 
novel artificial polypeptides which can be constructed using 
conventional recombinant DNA techniques and which can 
be more safely, economically and effectively used in a 
variety of applications which call for serum albumins or 
other related proteins. 

It is even further an object of the present invention to 
construct biologically active protein fragments that are use- 
ful for a wide variety of physiological, chromatographic and 
crystallographic functions which can be produced in large 
quantities and which can effectively be used instead of 
whole serum albumins obtained from natural or artificial 
sources. 

These and objects of the present invention are set forth in, 
or will become obvious from, the description of the pre- 
ferred embodiments provided hereinbelow. 

BRIEF DESCRIPTION OF THE DRAWING 
FIGURES: 

FIG. 1 is a stereo view illustrating the overall topology of 
human serum albumin. 

FIG. 2 is a representation of the sequence homology of the 
amino acid sequences of a variety of the serum albumins 
including from top to bottom, human serum albumin (SEQ 
ID NO:3), bovine serum albumin (SEQ ID NO:4), equine 
serum albumin (SEQ ID NO: 5), ovine serum albumin (SEQ 
ID NO:6), rat serum albumin (SEQ ID NO:7), frog serum 
albumin (SEQ ID NO: 8), and salmon serum albumin (SEQ 
ID NO:9). 

DETAILED DESCRIPTION OF THE 
PREFERRED EMBODIMENTS: 

In accordance with the present invention, the character- 
istic binding locations of the serum albumin family of 
proteins were determined cry stallographic ally at 3.1 Ang- 
stroms using a wild-type human serum albumin (HSA) and 
at 2.8 Angstroms for a recombinant form of HSA expressed 
in yeast (rHSA), A complete description of the structural 
determination of a serum albumin protein through crystal- 
lographic means is set forth in Nature, Vol. 358:209 (July 
1992), incorporated herein by reference. These crystallo- 
graphic studies confirmed that the topology of serum albu- 
mins such as human serum albumin is created by a repeating 
series of six helical subdomains, known as IA, IB, HA, HB, 
H3A and HCB. These six subdomains assemble to form a 
heart-shaped molecule, as had previously been determined 
in the stereo view illustration as observed in FIG. 1. 
However, the previous determinations of the serum albumin 
structure gave little insight into its binding locations, and it 
was previously thought that a number of the helical subdo- 
mains were involved in albumin binding. 

The detailed crystallography studies indicated that con- 
trary to the prior albumin models, the principal binding 
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regions were located specifically in subdomain HA and 
subdomain DIA. The binding cavity in region IHA appears 
to be the most active and accommodating on the human 
serum albumin, and many ligands have been found to 
5 preferentially bind in this region, such as digitoxin, ibupro- 
fen and tryptophan. Other ligand binding affinities have been 
tested, and relative binding locations have now been deter- 
mined crystallographically for several ligands at low 
resolution, as set forth below in Table 1. These tests showed 
i0 that aspirin and iodinated aspirin analogues show nearly 
equal distributions between binding sites HA and DIA, while 
the composition known as Warfarin appears to occupy a 
single site in HA. Further, the amino acid residues that have 
previously been thought to be involved in the binding 
15 process. Trp 214, Lys 199 and Tyr 411, are all located 
strategically in the DA or IDA regions. 


TABLE I 


Ligand 

Ligand binding locations to HSA 

Observed 

D N R f location 

Aspirin 

4.0 

7362 

0.11 

HA EDA 

Warfarin 

5.0 

2555 

0.167 

HA 

Diazepam 

6.8 

2075 

0.118 

IHA 

Digitoxin 

5.0 

3751 

0.137 

HIA 

Clofibrate 

6.0 

2175 

0.138 

IHA 

Ibuprofen 

6.0 

2402 

0.215 

mA 

AZT 

4.0 

7548 

0.124 

IHA 

IS 

4.0 

6334 

0.19 

HA IHA 

DIS 

4.0 

4734 

0.20 

HA IHA 

HB 

4.0 

5431 

0.12 

HA IHA 


Ligand-HSA complexes and X-ray diffraction data were obtained in a manner 
as previously described in Table 1 . The observed locations refer to the primary 
binding sites. 

D, Resolution or d-spacing in A. 

N f Number of paired unique reflections with F > 5a. 

Rf> - FplCT P . 

AZT, 3’ -Az ldo- 3 ’-deoxy thy midine . 

IS, 5-iodosaiicylic acid. 

DIS, 3,5-Diiodosaiicylic acid. 

TIB, 2,3,5-Triiodobenzoic acid 
40 

The structural determination of the binding regions of the 
serum albumin family of proteins shows that the amino acid 
sequences appear to be homologous along the various serum 
albumins, which is evidenced in FIG. 2 wherein the amino 
45 acid sequences of human, bovine, equine, ovine, rat, frog 
and salmon albumins are compared The crystallographic 
studies conducted in order to locate and identify the albumin 
protein binding sites appear to show that the DA subdomain 
is one of the key binding sites of the albumin protein, and 
so this region corresponds to an amino acid sequence beginning 
at approximately amino acid number 190 of the albumin 
protein and extending to about amino add number 300. In 
one specific embodiment, the sequence for the binding 
region DA as determined in bovine serum albumin is set 
55 forth at in SEQ ID NO: 1 , and this sequence runs from amino 
acid number 190 through amino acid number 298 on bovine 
serum albumin. 

The crystallographic studies carried out by the inventor 
also revealed that the IDA subdomain was another key 
60 binding site on the albumin family of proteins, and this 
binding subdomain corresponds to a sequence of amino 
acids which starts at about amino acid number 375 and 
extends to about amino acid number 495. In another specific 
embodiment, binding region HIA has an amino acid 
65 sequence as set forth in SEQ ID NO:2, and this sequence 
appears to run from amino acid 378 through 494. In accor- 
dance with the present invention, a protein fragment con- 
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taining at least one of the binding regions EEA or 3HA 
discussed above can be prepared which will have the same 
or similar biological activity as a whole natural serum 
albumin. 

In addition to the specific binding regions HA or IIIA 
discussed above, there also appears to be an additional 
fusion product of subdomains DA and IDA that also acts to 
give serum albumin some of its binding properties. This 
fusion product appears to be a fragment that includes not 
only binding regions DA and IDA, but subdomain HB as 
well. A protein fragment in accordance with the present 
invention can thus also be constructed which contains the 
region including DA, DB and DIA, and this region corre- 
sponds roughly to an amino acid sequence extending from 
about amino acid 190 to about amino acid 495 on a serum 
albumin family protein. Further, it is possible that such a 
fragment would be even more biologically active and more 
likely to preserve all of the original binding peculiarities 
associated with the albumin family of proteins since there 
are sometimes measurable allosteric effects between the 
subdomains. 

The isolation of any of the specific albumin family 
binding regions discussed above is advantageous in that not 
only can biologically active serums be produced from iso- 
lates of these binding fragments from the natural albumins, 
but recombinant methods can be used as well to construct 
protein fragments containing only these specific binding 
regions. In fact, the present invention is particularly advan- 
tageous because the protein fragments of the invention can 
be prepared artificially using conventional recombinant 
DNA techniques, and these fragments will be safer, more 
stable and more effective than the natural serums in a variety 
of applications, including column chromatography, 
biosensors, crystallographic or solution drug binding 
experimentation, and a wide range of medical and biochemi- 
cal procedures and experimentation. Thus, although isolates 
of the albumin proteins can be produced according to the 
present invention with one or more of the actual binding 
regions obtained from natural sources, it is preferred that 
conventional recombinant techniques be used to manufac- 
ture the protein fragments containing or corresponding to at 
least one of the binding regions discussed above, and these 
artificial fragments can be recovered and/or purified so as to 
useful in all applications where natural serum albumin 
would be used. 

In another aspect of the present invention, it has also been 
discovered that key invariant residues that are involved in 
the ligand binding subdomains and which are conserved in 
most or all the known albumins, and these key residues 
would thus appear to be primarily responsible for die 
binding properties attributed to these regions. Based on an 
examination of the sequence homology as observed in FTG. 
2, and based on other studies involving the crystallographic 
patterns of the albumin proteins, it appears that there are 
certain key residues that are conserved between all of the 
determined albumin sequences and that fit precisely in the 
binding regions HA and IDA discussed above. In particular, 
these key invariant or conserved residues appear to be at 
amino acid residues 257 and 260 of the DA region, and at 


6 

amino acid residues 390, 391, 410. 411, 423, 437, 450, 453 
and 485 of the DIA region. It is thus contemplated that any 
protein fragment that is constructed to contain at least the 
key residues of either or both of the subdomains DA and DIA 
5 as set forth above will also exhibit binding properties 
equivalent or similar to that of the whole albumin molecules. 

In summary, the present invention allows for the produc- 
tion of protein fragments containing specific binding sites of 
10 the albumin proteins which can be generated by conven- 
tional recombinant DNA techniques and which have the 
same or similar binding properties as the natural serum 
albumins. It is thus contemplated that these protein frag- 
ments can be prepared efficiently and economically in large 
15 quantities so as to substituted for the natural form of the 
albumins in a variety of applications without any loss of 
binding strength. As set forth herein, the term “protein 
fragment” is well understood the those skilled in the art and 
20 generally refers to those polypeptides comprising an amino 
acid sequence that only constitutes a portion of a whole 
protein molecule. 

These protein fragments, when constructed artificially 
using state-of-the-art recombinant means, will not only have 
25 the same or similar biological activity of the natural whole 
albumin proteins, but will also be safer that the natural form 
of the albumins since they will not carry many of the other 
viral or other pathogenic contaminants that are found in the 
30 natural products. As set forth herein, the term “biological 
activity” is well understood to one skilled in the art and is 
used generally to refer to the ability of a particular molecule, 
such as a whole protein or a particularly active fragment 
from a whole protein, to successfully carry out any of a 
35 number of biological or biochemical functions. 

When preparing fragments containing the specific binding 
regions of the present invention, it will be well understood 
by those skilled in the art that a number of alternate 
4 Q sequences can be prepared which will differ in some slight 
manner from the binding regions as discussed above, yet 
which are considered within the scope of the invention. For 
example, these alternate embodiments include those frag- 
ments or sequences which have slight variations as to 
45 specific amino adds, such as those which include an addi- 
tion or deletion of a particular amino add, possibly at the 
leading or trailing end of the fragment, which maintain the 
binding properties of the albumin family of proteins in the 
50 manner set forth above. Additionally, those sequences which 
contain certain changes in specific amino acids which may 
enhance or decrease the binding affinity of various 
compounds, or reduce the likelihood of producing an anti- 
genic response, will also be within the scope of the invention 
55 as would be obvious to one of ordinary skiU in the art. 
Finally, as set forth above, it is contemplated that because 
the subdomain regions of the multigene family of albumin 
proteins appear to be the same or similar, the biologically 
active protein fragments of the present invention can be 
constructed from specific binding regions of any of the 
proteins of the serum albumin family, such as the Gc and 
AFP proteins discussed above. All of these embodiments are 
deemed to be covered within the scope of the present 
invention which is set forth in the claims appended hereto. 
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SEQUENCE LISTING 

( 1 ) GENERAL INFORMATION: 

( i i i ) NUMBER OF SEQUENCES: 9 

( 2 ) INFORMATION FOR SEQ ID NO: 1: 

( i ) SEQUENCE CHARACTERISTICS: 

( A ) LENGTH: 109 amino acids 
< B ) TYPE: amino acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: protein 

( v ) FRAGMENT TYPE: internal 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:t: 


A 1 a 
1 

S e r 

S e r 

A 1 a 

A r g 
5 

G l n 

A r g 

Leu 

A r g 

C y s 
1 0 

A 1 

a 

S e t 

I 1 e 

G 1 n 

Ly s 
1 5 

P b e 

G l y 

G 1 u 

A r g 

A 1 a 

2 0 

Leu 

Ly s 

A 1 a 

T r p 

S e r 
2 5 

V a 1 

A I a 

A r g 

Leu 

S e r 
3 0 

G 1 n 

Ly s 

P h e 

P r o 

Ly s 
3 5 

A 1 a 

G 1 u 

Pie 

Va I 

G 1 u 
4 0 

Va 1 

T h r 

L y s 

Leu 

V a I 
4 5 

T h r 

Asp 

Leu 

T h i 

L y s 
5 0 

V a 1 

H i s 

L y s 

G t u 

C y s 
5 5 

C y s 

H i s 

G I y 

Asp 

Leu 
6 0 

Leu 

G 1 u 

C y s 

A 1 a 

Asp 
6 5 

Asp 

A r g 

A 1 a 

Asp 

Leu 
7 0 

Ala 

L y s 

Ty r 

1 I e 

C y s 
7 5 

Asp 

A s n 

G 1 n 

Asp 

Tbi 
8 0 

I 1 e 

S e t 

S e r 

L y s 

Leu 
8 5 

L y s 

G I u 

C y s 

C y s 

A s p 
9 0 

Ly s 

P r o 

Leu 

Leu 

G I u 
9 5 

Ly s 

S e r 

H i s 

C y s 

I 1 e 

A 1 a 

G 1 u 

Va i 

G 1 u 

L y s 

Asp 

A I 

a 

1 I e 

P r o 





10 0 10 5 


( 2 ) INFORMATION FOR SEQ ID NO:2: 

( i ) SEQUENCE CHARACTERISTICS: 

( A ) LENGTH: 117 amino acids 
( B ) TYPE: amino acid 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: protein 

( i i i ) HYPOTHETICAL: NO 

( i v ) ANTI-SENSE: NO 

( v ) FRAGMENT TYPE: N-tgimnal 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:2: 


H i 

s 

Leu 

V a 1 

Asp 

G 1 u 

P r o 

G 1 a 

A s n 

Leu 

I 1 e 

L y s 

G 1 n 

A s n 

C y s 

Asp 

G 1 a 

1 





5 





1 0 





1 5 


P b 

e 

G 1 u 

L y s 

Leu 

G 1 y 

G 1 u 

Ty r 

G I y 

Phe 

G 1 n 

A s n 

A 1 a 

Leu 

I I c 

V a 1 

Arg 





2 0 





2 5 





3 0 



Ty 

r 

T b r 

Arg 

L y s 

V a 1 

Pro 

G 1 n 

V a 1 

S e r 

T b r 

P r o 

T h r 

Leu 

V a 1 

G 1 u 

V a 1 




3 5 





4 0 





4 5 




S e 

r 

Arg 

S e r 

Leu 

G 1 y 

L y s 

V a 1 

G 1 y 

Thi 

Arg 

Cy s 

C y s 

T h r 

L y s 

P T O 

G 1 u 



5 0 





5 5 





6 0 





S e 

r 

G 1 u 

A r g 

Me t 

P r o 

C y s 

T b r 

G I u 

A s p 

Ty r 

Leu 

S e r 

Leu 

I 1 e 

Leu 

A s n 

6 5 






7 0 





7 5 





8 0 

A r 

8 

Leu 

Cy s 

V a 1 

Leu 

H i s 

G 1 u 

L y s 

T h r 

Pro 

Va 1 

S e r 

G 1 u 

L y s 

V a 1 

T h r 






8 5 





9 0 





9 5 


L y 

s 

C y s 

Cy s 

T b r 

G 1 u 

S e r 

Leu 

V a 1 

A s n 

Arg 

Arg 

P r o 

C y s 

Phe 

S c r 

A 1 a 
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100 105 110 

Leu Thr Pro Asp Glu 
1 1 5 

( 2 ) INFORMATION FOR SEQ ID NO:3: 

< i ) SEQUENCE CHARACTERISTICS: 

( A ) LENGTH: 585 amino acids 
( B ) TYPE: amino acid 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: protein 

( i i i ) HYPOTHETICAL: NO 

( i v ) ANTI-SENSE: NO 

( v ) FRAGMENT TYPE: N-terrmnal 


( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:3: 


Asp 

1 

A 1 a 

H i s 

L y s 

S e r 
5 

G 1 u 

V a 1 

A 1 a 

H i s 

Arg 
1 0 

Phe 

L y s 

Asp 

Leu 

G 1 y 
1 5 

G 1 u 

G I u 

A s n 

P b e 

L y s 
2 0 

Ala 

Leu 

V a 1 

Leu 

1 1 e 

2 5 

A 1 a 

Phe 

A 1 a 

G I n 

Ty r 
3 0 

Leu 

Gin 

G I n 

C y s 

P T o 
3 5 

Phe 

G 1 u 

Asp 

H i s 

V a 1 
4 0 

L y s 

Leu 

V a 1 

A s n 

G 1 u 
4 5 

V a l 

T h r 

G 1 u 

P h e 

A 1 a 
5 0 

Ly s 

T b r 

C y s 

V a 1 

A 1 a 

5 5 

Asp 

G 1 u 

S e r 

A 1 a 

G 1 u 
6 0 

A s n 

C y s 

Asp 

L y s 

5 e r 

6 5 

Leu 

H i s 

T b r 

Leu 

Phe 
7 0 

G 1 y 

A s p 

L y s 

Leu 

C y s 
7 5 

T h r 

V a 1 

A I a 

Thr 

Leu 
8 0 

Arg 

G 1 u 

T b r 

Ty r 

G 1 y 
8 5 

G l u 

Me t 

A 1 a 

Asp 

C y s 
9 0 

C y s 

A 1 a 

L y s 

G 1 n 

G 1 u 
9 5 

P r o 

G 1 u 

Arg 

A s n 

G 1 u 
I 0 0 

C y s 

Phe 

Leu 

G 1 n 

H i s 
1 0 5 

L y s 

Asp 

A s p 

A s n 

P r o 
1 1 0 

A s n 

Leu 

P r o 

Arg 

Leu 
1 1 5 

V a 1 

Arg 

P T o 

G 1 u 

V a I 
1 2 0 

Asp 

V a 1 

Me t 

C y s 

T h r 
1 2 5 

Ala 

Phe 

H i s 

Asp 

A s s 
1 3 0 

G 1 u 

G 1 u 

T h r 

Phe 

Leu 

1 3 5 

L y s 

L y s 

Ty r 

Leu 

Ty r 
1 4 0 

G 1 u 

1 1 e 

A 1 a 

A r g 

A r g 
1 4 5 

H i s 

P r o 

T y r 

Phe 

Ty r 
1 5 0 

A 1 a 

P r o 

G 1 u 

Leu 

Leu 
1 5 5 

Phe 

Phe 

A 1 a 

L y s 

Arg 
1 6 0 

Ty r 

L y s 

A 1 a 

A 1 a 

Phe 
1 6 5 

T b r 

G 1 u 

C y s 

C y s 

G 1 n 
1 7 0 

A 1 a 

Ala 

Asp 

L y s 

A 1 a 
1 7 5 

A 1 a 

C y s 

Leu 

Leu 

P r o 
1 8 0 

L y s 

Leu 

Asp 

G 1 u 

Leu 
1 8 5 

Arg 

A s p 

G 1 u 

G 1 y 

L y s 
1 9 0 

Ala 

S e r 

S e r 

Ala 

L y s 
1 9 5 

G 1 n 

A r g 

Leu 

L y s 

C y s 
2 0 0 

Ala 

S e r 

Leu 

G 1 n 

L y s 
2 0 5 

Phe 

G 1 y 

G 1 u 

Arg 

A 1 a 
2 t 0 

Phe 

L y s 

A 1 a 

Tt p 

Ala 
2 1 5 

V a 1 

A 1 a 

Arg 

Leu 

S e r 
2 2 0 

G 1 n 

Arg 

Phe 

Pro 

L y s 
2 2 5 

A 1 a 

G 1 u 

Phe 

A 1 a 

G I u 
2 3 0 

V a 1 

S e r 

L y s 

Leu 

V a 1 
2 3 5 

T h r 

Asp 

Leu 

T h r 

L y s 
2 4 0 

V a 1 

H i s 

T h r 

G I u 

C y s 
2 4 5 

C y s 

H i s 

G 1 y 

Asp 

Leu 
2 5 0 

Leu 

G 1 u 

C y s 

A 1 a 

Asp 
2 5 5 

Asp 

Arg 

A I a 

Asp 

Leu 
2 6 0 

A 1 a 

Ly s 

Ty r 

I 1 e 

Cy s 
2 6 5 

G 1 u 

A s B 

G 1 n 

Asp 

S e r 
2 7 0 

I 1 e 

S e r 

S C T 

Ly s 

Leu 
2 7 5 

L y s 

G 1 u 

C y s 

C y s 

G 1 u 
2 8 0 

Ly s 

Pro 

Leu 

Leu 

G 1 u 
2 8 5 

Ly s 

Sex 

H i s 

C y s 

I 1 e 

A 1 a 

G 1 u 

V a 1 

G 1 u 

A s n 

Asp 

G 1 u 

Me l 

Pro 

A 1 a 

Asp 

Leu 

P r o 

S e r 


290 295 300 
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Leu 
3 0 5 

Ala 

A l a 

Asp 

P h e 

V a I 
3 1 0 

G 1 u 

S e r 

L y s 

Asp 

V a 1 

3 1 5 

Cy s 

L y s 

A s n 

Ty x 

G I u 

A 1 a 

L y s 

Asp 

V a 1 
3 2 5 

P b e 

Leu 

G I y 

Me t 

Phe 
3 3 0 

Leu 

Ty r 

G 1 u 

Ty r 

A 1 a 
3 3 5 

A r g 

H i s 

P r o 

Asp 
3 4 0 

Ty r 

S e r 

Va 1 

V a 1 

Leu 
3 4 5 

Leu 

Leu 

A r g 

Leu 

A I a 
3 5 0 

Ly s 

Ty r 

G 1 u 

T h r 
3 5 5 

T b r 

Leu 

G 1 u 

L y s 

C y s 
3 6 0 

Cy s 

A 1 a 

A 1 a 

H i s 

Asp 
3 6 5 

P x o 

H i s 

Cy s 

Ty r 
3 7 0 

A l a 

L y s 

V a 1 

Phe 

A s p 
3 7 5 

G 1 u 

Phe 

L y s 

P r o 

Leu 
3 8 0 

V a 1 

G 1 u 

G 1 u 

G I n 
3 8 5 

A S Q 

Leu 

1 1 e 

Ly s 

G 1 n 
3 9 0 

A s n 

C y s 

G 1 u 

Leu 

P b e 
3 9 5 

Ly s 

G 1 n 

Leu 

G 1 y 

Ty r 

Ly s 

P b e 

G 1 n 

A s n 
4 0 5 

Ala 

Leu 

Leu 

Va 1 

A r g 
4 1 0 

Ty r 

T h r 

Ly s 

Ly s 

V a 1 
4 1 5 

G 1 n 

V a I 

S e r 

Thr 
4 2 0 

P r o 

Tb r 

Leu 

Va 1 

G 1 u 
4 2 5 

V a 1 

S e r 

A r g 

A s n 

Leu 
4 3 0 

G 1 y 

Va 1 

G 1 y 

Sex 
4 3 5 

L y s 

C y s 

C y s 

L y s 

H i s 
4 4 0 

P r o 

G 1 u 

A 1 a 

Ly s 

A r g 
4 4 5 

M e t 

P r o 

A 1 a 

G 1 u 
4 5 0 

Asp 

T y r 

Leu 

Set 

V a 1 
4 5 5 

Va 1 

Leu 

A s n 

G 1 a 

Leu 
4 6 0 

C y s 

V a l 

Leu 

G 1 u 
4 6 5 

Ly s 

T h r 

Pro 

Va 1 

S e r 
4 7 0 

Asp 

A r g 

V a 1 

T h r 

L y s 
4 7 5 

C y s 

C y s 

T h r 

G 1 u 

Leu 

V a 1 

A s n 

A r g 

A r g 
4 8 5 

P X o 

C y s 

Phe 

S e x 

A I a 
4 9 0 

Leu 

G 1 u 

V a I 

Asp 

G 1 u 
4 9 5 

T y r 

V a 1 

P r o 

L y s 
5 0 0 

G 1 u 

Phe 

A s n 

A 1 a 

G 1 u 
5 0 5 

T h r 

Phe 

Th r 

Phe 

H i s 
5 l 0 

Ala 

I I e 

C y s 

T b r 
5 I 5 

Leu 

S e r 

G l u 

L y s 

G 1 u 
5 2 0 

A r g 

G 1 n 

I 1 e 

Ly s 

L y s 
5 2 5 

G I n 

Thr 

Leu 

V a 1 

5 3 0 

G I t> 

Leu 

V a 1 

L y s 

H i s 

5 3 5 

L y s 

P r o 

L y s 

A 1 a 

Th r 
5 4 0 

Ly s 

G 1 u 

G 1 n 

L y s 
5 4 5 

A 1 a 

V a 1 

Me t 

Asp 

Asp 
5 5 0 

Phe 

Ala 

A 1 a 

Phe 

V a 1 
5 5 5 

G l u 

Ly s 

C y s 

C y s 

Ala 

Asp 

Asp 

L y s 

G 1 u 
5 6 5 

T h r 

C y s 

Phe 

A 1 a 

G 1 u 
5 7 0 

G 1 u 

G l y 

Ly s 

Ly s 

Leu 
5 7 5 

Ala 

Ala 

S e r 

G 1 n 

A 1 a 

A I a 

Leu 

G 1 y 

Leu 








580 585 


( 2 > INFORMATION FOR SBQ ID NO:4: 

< i ) SEQUENCE CHARACTERISTICS: 

( A ) LENGTH: 583 amino acids 
( B ) TYPE: amino acid 
( D ) TOPOLOGY: linear 

( i r ) MOLECULE TYPE: protein 

( i i i ) HYPOTHETICAL: NO 

( i v ) ANTI-SENSE: NO 

< v ) FRAGMENT TYPE: N-termmal 


( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:4: 


A s 

P 

T b r 

H i 

s 

L y s 

S c 

r 

G 1 

u 

I 1 e 

A 1 

a 

H i 

s 

A 

r 

g 

P b e 

Ly 

s 

A s 

P 

Leu 

G 1 

y 

1 






5 









1 

0 








1 5 


G 1 

u 

H i s 

P h 

c 

L y s 

G l 

y 

L e 

u 

V a 1 

L e 

u 

I 1 

e 

A 

l 

a 

Phe 

S e 

r 

G 1 

n 

Ty x 

L e 

u 






2 0 








2 5 










3 0 



G 1 

n 

Cy s 

P r 

o 

Phe 

A s 

p 

G 1 

u 

H 1 s 

Va 

1 

Ly 

s 

L 

e 

u 

V a 1 

A s 

n 

G 1 

u 

Leu 

T b 

r 


35 40 45 


A I a 

3 2 0 

A r g 
T b r 
G 1 u 
Pro 

G 1 n 

4 0 0 

P r o 
L y s 
C y s 
H i s 

5 e r 

4 8 0 

T h r 
Asp 
A 1 a 
Leu 

L y s 

5 6 0 

V a I 


G 1 u 
G 1 n 
G 1 u 
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P h c 

A 1 a 
5 0 

L y s 

T h r 

C y s 

V a 1 

A 1 a 

5 5 

Asp 

5 e r 

6 5 

Leu 

H i s 

T b r 

Leu 

Phe 

7 0 

G 1 y 

Asp 

A r g 

G 1 u 

T h r 

Ty r 

G 1 y 
8 5 

A s p 

Me t 

A 1 a 

G 1 u 

A i g 

A s (i 

G 1 u 
1 0 0 

Cy s 

P b e 

Leu 

S e r 

P r o 

L y s 

Leu 
1 1 5 

L y s 

P t o 

Asp 

Pro 

A s n 
1 2 0 

Asp 

G 1 u 
l 3 0 

L y s 

L y s 

P h e 

Tr p 

G 1 y 
1 3 5 

L y s 

H i s 
l 4 5 

P r o 

Ty i 

P h e 

Ty r 

A I a 
1 5 0 

Pro 

G 1 u 

A s n 

G 1 y 

V a 1 

P h e 

G 1 n 
1 6 5 

G 1 u 

Cy s 

C y s 

Leu 

Leu 

P r o 

Ly s 
1 8 0 

1 1 e 

G 1 u 

T b r 

Me t 

A 1 a 

A r g 

G l n 
1 9 5 

Aig 

Leu 

A T g 

C y s 

A 1 a 
2 0 0 

A 1 a 

Leu 
2 l 0 

L y s 

A 1 a 

Tr p 

S e r 

V a 1 
2 1 5 

A 1 a 

A 1 a 
2 2 5 

G 1 u 

P h e 

V a 1 

G 1 u 

V a 1 
2 3 0 

Thr 

Ly s 

H i s 

L y s 

G 1 u 

Cy s 

C y s 
2 4 5 

H i s 

G 1 y 

Asp 

A 1 a 

Asp 

Leu 

A 1 a 
2 6 0 

L y s 

Ty r 

1 1 e 

C y s 

L y s 

Leu 

L y s 
2 7 5 

G 1 u 

Cy s 

C y s 

A s p 

L y s 
2 8 0 

t 1 e 

Ala 
2 9 0 

G 1 u 

Va 1 

G 1 u 

L y s 

Asp 
2 9 5 

A 1 a 

T h r 

3 0 5 

A 1 a 

Asp 

P h e 

A 1 a 

G 1 u 
3 1 0 

Asp 

L y s 

A ] a 

L y s 

A s p 

A 1 a 

P b e 

3 2 5 

Leu 

G 1 y 

S e r 

H i s 

P r o 

G 1 u 

Ty r 
3 4 0 

A I a 

V a 1 

S e r 

V a 1 

G 1 u 

A 1 a 

T h r 
3 5 5 

Leu 

G 1 u 

G 1 u 

C y s 

Cy s 
3 6 0 

Ty r 

S e r 
3 7 0 

T h r 

V a 1 

Phe 

Asp 

Ly s 
3 7 5 

Leu 

A s n 
3 8 5 

Leu 

I 1 e 

L y s 

G 1 n 

A s n 
3 9 0 

C y s 

Asp 

G 1 y 

P b e 

G 1 n 

Asa 

A 1 a 
4 0 5 

Leu 

I 1 e 

V a 1 

Va 1 

S e r 

T b r 

P T o 
4 2 0 

T h r 

Leu 

Va 1 

G 1 u 

G 1 y 

T h r 

A r g 
4 3 5 

C y s 

C y s 

T h r 

L y s 

P r o 
4 4 0 

G 1 u 

Asp 
4 5 0 

Ty r 

Leu 

S e r 

Leu 

I 1 e 
4 5 5 

Leu 

L y s 

T h r 

P r o 

V a 1 

S e r 

G 1 u 

L y s 

V a I 


G 1 u 

S e i 

H i s 

A l a 
6 0 

G 1 y 

C y s 

G l L) 

L y s 

G 1 u 

Leu 

C y s 
7 5 

L y s 

V a I 

A l a 

S e r 

Leu 
8 0 

Asp 

Cy s 
9 0 

C y s 

G 1 u 

L y s 

G 1 n 

G 1 u 
9 5 

P r o 

Hi s 
1 0 5 

L y s 

Asp 

Asp 

S e r 

P r o 
1 1 0 

A s p 

Leu 

Th r 

Leu 

C y s 

Asp 

G 1 u 
1 2 5 

P b e 

L y s 

A 1 a 

Ty r 

Leu 

Ty r 

G 1 u 
1 4 0 

1 1 e 

A 1 a 

A r g 

A r g 

Leu 

Leu 

Ty r 
1 5 5 

Ty r 

A I a 

Asa 

L y s 

Ty r 
1 6 0 

G 1 n 

A 1 a 
1 7 0 

G 1 u 

Asp 

Ly s 

G 1 y 

Ala 
1 7 5 

C y s 

A r g 
1 8 5 

G 1 u 

L y s 

Va 1 

Leu 

A I a 
1 9 0 

S e r 

Ser 

Se r 

I 1 e 

G 1 a 

L y s 

Phe 

2 0 5 

G 1 y 

G 1 u 

A r g 

A T g 

Leu 

S e r 

G 1 n 

2 2 0 

L y s 

Phe 

P r o 

Ly s 

Leu 

V a 1 

T h r 
2 3 5 

Asp 

Leu 

T b r 

L y s 

V a 1 
2 4 0 

Leu 

Leu 
2 5 0 

G 1 u 

Cy s 

Ala 

Asp 

Asp 
2 5 5 

A r g 

Asp 
2 6 5 

A s n 

G 1 a 

A s p 

T b r 

1 1 e 

2 7 0 

Ser 

Ser 

P r o 

Leu 

Leu 

G 1 u 

L y s 
2 8 5 

S e r 

H i s 

C y s 

I 1 e 

P r o 

G 1 u 

Asa 
3 0 0 

Leu 

P r o 

P r o 

Leu 

Asp 

V a 1 

Cy s 
3 1 5 

Ly s 

Asa 

Ty r 

G 1 n 

G 1 u 
3 2 0 

Phe 

Leu 
3 3 0 

Ty r 

G 1 u 

Ty r 

S e r 

A r g 

3 3 5 

A T g 

Leu 
3 4 5 

Leu 

A r g 

Leu 

Ala 

L y s 
3 5 0 

G l u 

Ty r 

A 1 a 

L y s 

Asp 

Asp 

Pro 
3 6 5 

H i s 

A 1 a 

C y s 

Ly s 

H i s 

Leu 

V a 1 
3 8 0 

Asp 

G l u 

P t o 

G 1 a 

G 1 n 

P b e 

G 1 u 
3 9 5 

L y s 

Leu 

G i y 

G 1 u 

Ty r 
4 0 0 

A r g 

Ty i 
4 1 0 

T h i 

A r g 

Ly s 

V a l 

Pro 
4 1 5 

G 1 n 

Va 1 
4 2 5 

S e r 

A r g 

S e r 

Leu 

G 1 y 
4 3 0 

L y s 

V a 1 

G 1 u 

S e r 

G 1 u 

A r g 

Me t 
4 4 5 

P r o 

C y s 

T h r 

A s n 

A i g 

Leu 

C y s 
4 6 0 

V a 1 

Leu 

H i s 

G 1 u 

Th r 

Ly s 

C y s 

C y s 

T h r 

G 1 u 

S e r 

Leu 
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465 470 475 480 


V a 1 

As D 

Ar g 

A r g 

P r o 
4 8 5 

C y s 

P b e 

S e r 

A 1 a 

Leu 
4 9 0 

T h r 

P r o 

Asp 

G l u 

T b r 
4 9 5 

Ty r 

V a 1 

P r o 

Ly s 

A 1 a 
5 0 0 

P b e 

A s p 

G 1 u 

L y s 

Leu 
5 0 5 

P b e 

T h r 

P b e 

H i s 

A 1 a 
5 1 0 

Asp 

I 1 e 

C y s 

T b r 

Leu 
5 1 5 

P I o 

Asp 

T h r 

G 1 u 

L y s 
5 2 0 

G i n 

I 1 e 

L y s 

L y s 

G 1 n 

5 2 5 

T b r 

A 1 a 

Leu 

V a 1 

G 1 u 
5 3 0 

Leu 

Leu 

Ly s 

H i s 

L y s 
5 3 5 

P r o 

L y s 

A 1 a 

T h r 

G 1 u 
5 4 0 

G i u 

G 1 n 

Leu 

L y s 

T h r 
5 4 5 

V a 1 

Me t 

G 1 u 

A s n 

P b e 

5 5 0 

V a 1 

A 1 a 

Phe 

V a 1 

Asp 
5 5 5 

L y s 

C y s 

C y s 

Ala 

A 1 a 
5 6 0 

Asp 

Asp 

L y s 

G 1 u 

A 1 a 
5 6 5 

Cy s 

Phe 

A 1 a 

Va 1 

G 1 u 
5 7 0 

G 1 y 

P T o 

Ly s 

Leu 

V a 1 
5 7 5 

V a 1 

S e r 

T h r 

Gin 

T b r 

A 1 a 

Leu 

A 1 a 
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( 2 ) INFORMATION FOR SEQ ID NO:5: 

( i ) SEQUENCE CHARACTERISTICS: 

( A ) LENGTH: 583 amino adds 
( B > TYPE: amino acid 
< D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: protein 

(iii ) HYPOTHETICAL: NO 

( i v > ANTI-SENSE: NO 

( v ) FRAGMENT TYPE: N- terminal 


( x i ) SEQUENCE DESCRIPTION: SEQ ID NO 3: 


A s p 
1 

T h r 

H i s 

L y s 

S e r 
5 

G I u 

I 1 e 

A 1 a 

H i s 

A r g 
1 0 

Phe 

A s n 

Asp 

Leu 

G 1 y 
1 5 

G 1 u 

L y s 

H i s 

Phe 

Ly s 
2 0 

G l y 

Leu 

V a l 

Leu 

Va 1 
2 5 

A 1 a 

Phe 

S e r 

G 1 n 

Ty r 
3 0 

Leu 

Gin 

G 1 n 

Cy s 

P r o 
3 5 

Phe 

G 1 u 

Asp 

H i s 

V a 1 

4 0 

L y s 

Leu 

V a 1 

A s n 

G 1 u 
4 5 

V a 1 

T h r 

G 1 u 

P b e 

A 1 a 
5 0 

L y s 

L y s 

Cy s 

A 1 a 

A 1 a 
5 5 

Asp 

G 1 u 

S e r 

A 1 a 

G 1 u 
6 0 

A s n 

Cy s 

A s p 

L y s 

5 e r 

6 5 

Leu 

H i s 

T b r 

Leu 

Phe 
7 0 

G 1 y 

Asp 

Ly s 

Leu 

Cy s 
7 5 

T h i 

V a 1 

A 1 a 

T h r 

Leu 
8 0 

A r g 

Ala 

T h r 

Ty r 

G 1 y 
8 5 

G 1 u 

Leu 

A l a 

A s p 

Cy s 
9 0 

C y s 

G 1 u 

Ly s 

G 1 n 

G 1 u 
9 5 

P r o 

G 1 u 

A r g 

A s n 

G 1 u 
1 0 0 

C y s 

Phe 

Leu 

T h r 

Hi s 
1 0 5 

Ly s 

A s p 

Asp 

H i s 

P r o 
1 1 0 

A s n 

Leu 

P r o 

L y s 

Leu 
1 l 5 

Ly s 

P r o 

G 1 u 

P r o 

A s p 
1 2 0 

A 1 a 

G 1 n 

C y s 

A 1 a 

A 1 a 
1 2 5 

Phe 

G 1 n 

G 1 u 

Asp 

P r o 
1 3 0 

Asp 

L y s 

Phe 

Leu 

G 1 y 
1 3 5 

Ly s 

Ty r 

Leu 

Ty r 

G 1 u 
1 4 0 

V a 1 

A 1 a 

A r g 

A r g 

H i s 
1 4 5 

P r o 

Ty r 

Phe 

Ty r 

G I y 
1 5 0 

P x o 

G 1 u 

Leu 

Leu 

Phe 
1 5 5 

H i s 

A 1 a 

G 1 u 

G 1 u 

Ty r 
1 6 0 

L y s 

A 1 a 

Asp 

Phe 

T h t 
1 6 5 

G 1 u 

C y s 

C y s 

P r o 

A t a 
1 7 0 

Asp 

A s p 

Ly s 

Leu 

Ala 
1 7 5 

C y s 

Leu 

I 1 e 

P r o 

L y s 
1 8 0 

Leu 

Asp 

A 1 a 

Leu 

Ly s 
1 8 5 

G 1 u 

A r g 

I 1 e 

Leu 

Leu 
1 9 0 

S e i 

S e r 

A 1 a 

L y s 

G 1 u 
1 9 5 

A r g 

Leu 

L y s 

C y s 

S e r 
2 0 0 

S e r 

P b e 

G 1 n 

A s n 

Phe 
2 0 5 

G 1 y 

G 1 u 

A r g 
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A 1 a 

V a I 
2 l 0 

L y s 

A 1 a 

T r p 

S e r 

V a 1 
2 1 5 

A 1 a 

A r g 

Leu 

S e r 

G 1 n 
2 2 0 

L y s 

P h e 

P r o 

L y s 

A 1 a 
2 2 5 

Asp 

P h e 

A 1 a 

G l u 

V a 1 

2 3 0 

S e r 

L y s 

1 1 e 

V a 1 

T b r 
2 3 5 

Asp 

Leu 

Thi 

L y s 

V a 1 
2 4 0 

H i s 

L y s 

G 1 u 

C y s 

C y s 
2 4 5 

H i s 

G 1 y 

Asp 

Leu 

Leu 
2 5 0 

G 1 u 

C y s 

A 1 a 

Asp 

Asp 
2 5 5 

A r g 

A 1 a 

Asp 

Leu 

A 1 a 
2 6 0 

L y s 

Ty r 

1 1 e 

C y s 

G 1 u 
2 6 5 

H i s 

G 1 n 

Asp 

S e r 

1 1 e 

2 7 0 

S e r 

G 1 y 

Ly s 

Leu 

Ly s 
2 7 5 

A 1 a 

Cy s 

Cy s 

Asp 

L y s 
2 8 0 

P r o 

Leu 

Leu 

G 1 n 

L y s 
2 8 5 

S e r 

H i s 

C y s 

I i e 

A ! a 
2 9 0 

G 1 u 

V a 1 

Ly s 

G 1 u 

Asp 
2 9 5 

Asp 

Leu 

P r o 

S e r 

Asp 
3 0 0 

I 1 e 

P r o 

A 1 a 

Leu 

A 1 a 
3 0 5 

A 1 a 

Asp 

P h e 

A 1 a 

G 1 u 
3 1 0 

Asp 

L y s 

G 1 u 

I 1 e 

C y s 
3 1 5 

L y s 

H i s 

Ty r 

L y s 

Asp 
3 2 0 

A 1 a 

L y s 

Asp 

V a 1 

Phe 

3 2 5 

Leu 

G 1 y 

T h r 

Phe 

Leu 
3 3 0 

Ty r 

G 1 u 

Ty r 

Sc r 

A r g 

3 3 5 

A r g 

H i s 

P r o 

A s p 

Tyr 
3 4 0 

S e r 

V a 1 

S e r 

Leu 

Leu 
3 4 5 

Leu 

A r g 

I 1 e 

A 1 a 

L y s 
3 5 0 

T b r 

Ty r 

G 1 u 

A 1 a 

T b r 
3 5 5 

Leu 

G 1 u 

Ly s 

C y s 

C y s 
3 6 0 

A 1 a 

G 1 u 

A l a 

Asp 

P r o 
3 6 5 

P r o 

A 1 a 

C y s 

T y r 

A r g 

3 7 0 

T h r 

V a 1 

P b e 

A s p 

G 1 n 
3 7 5 

Phe 

T h r 

P r o 

Leu 

V a 1 
3 8 0 

G 1 u 

G 1 u 

P r o 

L y s 

S e r 
3 8 5 

Leu 

V a 1 

L y s 

L y s 

A s n 
3 9 0 

C y s 

Asp 

Leu 

Phe 

G 1 u 
3 9 5 

G 1 u 

V a 1 

G 1 y 

G 1 u 

Ty r 
4 0 0 

Asp 

P b e 

G 1 n 

A s n 

A 1 a 

4 0 5 

Leu 

I 1 e 

V a I 

Ar g 

Ty r 
4 1 0 

T h r 

Ly s 

L y s 

A 1 a 

P r o 
4 1 5 

G 1 n 

Va 1 

S e r 

T h r 

P r o 
4 2 0 

T h r 

Leu 

V a 1 

G 1 u 

I 1 e 

4 2 5 

G 1 y 

A r g 

T h r 

Leu 

G l y 
4 3 0 

L y s 

V a 1 

G 1 y 

S e r 

A r g 
4 3 5 

Cy s 

C y s 

Ly s 

Leu 

P r o 
4 4 0 

G 1 u 

S e r 

G 1 u 

A r g 

Leu 
4 4 5 

P r o 

C y s 

S e r 

G 1 u 

A s n 
4 5 0 

H i s 

Leu 

A 1 a 

Leu 

A 1 a 

4 5 5 

Leu 

A s n 

Ar g 

Leu 

C y s 
4 6 0 

V a 1 

Leu 

H i s 

G 1 u 

L y s 
4 6 5 

T h r 

P r o 

V a 1 

S c r 

G I u 
4 7 0 

L y s 

1 1 e 

T h r 

L y s 

C y s 
4 7 5 

C y s 

T h r 

Asp 

S c r 

Leu 
4 8 0 

A 1 a 

G 1 u 

A r g 

A r g 

P t o 
4 8 5 

Cy s 

Phe 

S e r 

A 1 a 

Leu 
4 9 0 

G 1 u 

Leu 

Asp 

G 1 u 

G 1 y 
4 9 5 

Tyr 

P r o 

V a 1 

L y s 

G 1 u 
5 0 0 

Phe 

L y s 

A 1 a 

G 1 u 

T h r 

5 0 5 

Phe 

T h r 

Phe 

H i s 

Ala 
5 1 0 

Asp 

I 1 e 

C y s 

T h r 

Leu 
5 1 5 

Pro 

G 1 u 

Asp 

G 1 u 

L y s 
5 2 0 

G 1 n 

I 1 e 

L y s 

L y s 

G 1 n 
5 2 5 

S e r 

A 1 a 

Leu 

A 1 a 

G 1 u 
5 3 0 

Leu 

V a 1 

L y s 

H i s 

L y s 
5 3 5 

Pro 

L y s 

A l a 

T b r 

L y s 
5 4 0 

G 1 u 

G 1 n 

Leu 

L y s 

T h r 
5 4 5 

V a 1 

Leu 

G 1 y 

A s n 

Phe 
5 5 0 

S e r 

A 1 a 

P b e 

V a 1 

Ala 
5 5 5 

L y s 

C y s 

C y s 

G 1 y 

A r g 
5 6 0 

G 1 u 

Asp 

L y s 

G 1 u 

Ala 
5 6 5 

C y s 

Phe 

A 1 a 

G 1 u 

G 1 u 
5 7 0 

G 1 y 

P r o 

L y s 

Leu 

V a 1 
5 7 5 

A 1 a 

S e r 

S e r 

G 1 a 

Leu 

A 1 a 

Leu 

A 1 a 
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( 2 ) INFORMATION FOR SEQ ID NO:6: 

( i ) SEQUENCE CHARACTERISTICS: 

( A ) LENGTH: 583 amino acids 
( B ) TYPE: amino acid 
( D ) TOPOLOGY: linear 
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-continued 

( i i ) MOLECULE TYPE: protein 
( i i i ) HYPOTHETICAL: NO 
( i v ) ANTI-SENSE: NO 

( v ) FRAGMENT TYPE: N-termmal 


(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:6i 


Asp 

I 

T b r 

H i s 

L y s 

S c r 
5 

G 1 u 

I l e 

A 1 a 

H i s 

A r g 
1 0 

P h e 

Asn 

Asp 

Leu 

G 1 y 
1 5 

G 1 u 

G 1 u 

A s o 

P h e 

G 1 a 
2 0 

G 1 y 

Leu 

Va 1 

Leu 

1 1 e 

2 5 

A 1 a 

P h e 

S e r 

G 1 n 

T y r 
3 0 

Leu 

G 1 n 

G 1 o 

C y s 

P r o 

3 5 

P b e 

Asp 

G 1 u 

H i s 

Va 1 
4 0 

L y s 

Leu 

V a 1 

L y s 

G 1 u 
4 5 

Leu 

T b T 

G 1 u 

P b c 

A 1 a 
5 0 

L y s 

T h i 

C y s 

V a 1 

Ala 
5 5 

Asp 

G 1 u 

S e r 

H i s 

A 1 a 
6 0 

G 1 y 

C y s 

Asp 

L y s 

5 c r 

6 5 

Leu 

H i s 

T h r 

Leu 

Pht 
7 0 

G 1 y 

Asp 

Glu 

Leu 

C y s 
7 5 

Ly s 

V a 1 

A 1 a 

T h r 

Leu 
8 0 

A r g 

G 1 u 

Tbr 

Ty r 

G I y 
8 5 

Asp 

Me t 

A 1 a 

Asp 

C y s 
9 0 

C y s 

G 1 u 

Ly s 

G 1 n 

G 1 u 
9 5 

Pro 

G 1 u 

Arg 

Asn 

G 1 u 
1 0 0 

C y s 

P b e 

Leu 

A s n 

Hi s 
1 0 5 

Ly s 

A s p 

A s p 

S e r 

Pro 
1 1 0 

Asp 

Leu 

Pro 

Ly s 

Leu 
1 1 5 

L y s 

P r o 

G 1 u 

P r o 

A s p 
1 2 0 

T b i 

Leu 

C y s 

A I a 

G 1 u 
1 2 5 

P h e 

L y s 

A I a 

A s p 

G l u 
1 3 0 

L y s 

L y s 

P h e 

Tr p 

G 1 y 
1 3 5 

L y s 

Ty r 

Leu 

Ty r 

G I u 
1 4 0 

V a 1 

A 1 a 

Arg 

Arg 

H i s 
1 4 5 

P r o 

Ty r 

P h e 

Ty r 

A 1 a 
1 5 0 

P r o 

G 1 u 

Leu 

Leu 

Ty r 
1 5 5 

Ty r 

Ala 

Asn 

Ly s 

Ty r 
1 6 0 

A S D 

G I y 

V a 1 

P h e 

G 1 n 
1 6 5 

G 1 u 

C y s 

Cy s 

G l n 

A 1 a 
1 7 0 

G 1 u 

Asp 

L y s 

G 1 y 

A 1 a 
l 7 5 

C y s 

Leu 

Leu 

P r o 

L y s 
1 8 0 

I 1 e 

Asp 

A 1 a 

Me t 

Arg 
1 8 5 

G 1 u 

L y s 

Va 1 

Leu 

A l a 
1 9 0 

S e r 

S e r 

Ala 

Arg 

G 1 n 
1 9 5 

Arg 

Leu 

A T g 

C y s 

Ala 
2 0 0 

S e r 

I I e 

G 1 u 

L y s 

P h e 

2 0 5 

G 1 y 

G 1 u 

Arg 

Ala 

Leu 
2 1 0 

L y s 

Ala 

T r p 

S e r 

V a I 
2 1 5 

A 1 a 

At g 

Leu 

S e r 

G 1 a 
2 2 0 

L y s 

P b e 

P T O 

L y s 

A 1 a 

2 2 5 

A s p 

P h e 

T b r 

Asp 

V a 1 

2 3 0 

T h r 

L y s 

[ 1 e 

Va 1 

Th r 
2 3 5 

Asp 

Leu 

Th r 

L y s 

V a I 
2 4 0 

H i s 

L y s 

G 1 u 

C y s 

C y s 
2 4 5 

h i s 

G 1 y 

Asp 

Leu 

Leu 
2 5 0 

G 1 u 

Cy s 

A 1 a 

Asp 

Asp 
2 5 5 

Arg 

A 1 a 

Asp 

Leu 

A I a 
2 6 0 

Ly s 

Ty r 

1 I e 

C y s 

Asp 
2 6 5 

H i s 

G 1 n 

Asp 

A I a 

Leu 
2 7 0 

S c r 

S e r 

L y s 

Leu 

L y s 
2 7 5 

G 1 u 

C y s 

C y s 

Asp 

L y s 
2 8 0 

P r o 

V a 1 

Leu 

G 1 u 

L y s 
2 8 5 

S e r 

H i 

s 

Cy s 

I I e 

A 3 a 
2 9 0 

G 1 u 

V a 1 

A s p 

Ly s 

Asp 
2 9 5 

A I a 

Va 1 

P r o 

G 1 u 

Asn 

3 0 0 

Leu 

P r o 

P r o 

Leu 

T b r 
3 0 5 

A 1 a 

Asp 

P h e 

A 1 a 

G 1 u 
3 1 0 

A s p 

Ly s 

G 1 u 

V a I 

C y s 
3 1 5 

L y s 

Asn 

Ty i 

G 1 

[ n 

G 1 u 
3 2 0 

A 1 a 

Ly s 

A s p 

Va 1 

P b e 

3 2 5 

Leu 

G 1 y 

S e r 

Ph e 

Leu 
3 3 0 

T y r 

G 1 u 

Ty r 

S e r 

Arg 

3 3 5 

Arg 

H i s 

P r o 

G 1 u 

Ty i 

3 4 0 

A I a 

V a 1 

Se r 

V a 1 

Leu 
3 4 5 

Leu 

A r g 

Leu 

A 1 a 

L y s 
3 5 0 

G 1 u 

Ty r 

G 1 u 

A 1 a 

T b i 
3 5 5 

Leu 

G 1 u 

Asp 

C y s 

C y s 
3 6 0 

A l a 

Ly s 

G 1 u 

Asp 

P r o 
3 6 5 

H i s 

A 1 a 

C y s 

Ty r 

Ala 

T b r 

Va 1 

P b e 

Asp 

Ly s 

Leu 

L y s 

H i s 

Leu 

Va 1 

Asp 

G 1 u 

Pro 

G 1 n 
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370 375 380 


A s n 
3 8 5 

Leu 

1 1 e 

L y s 

Ly s 

A s n 
3 9 0 

C y s 

G I u 

Leu 

P b e 

G 1 u 
3 9 5 

L y s 

H i s 

G 1 y 

G 1 u 

G l y 

P b e 

G I n 

A s n 

A I a 
4 0 5 

Leu 

I I e 

V a 1 

A r g 

Ty r 
4 1 0 

T h r 

Arg 

L y s 

A I a 

P r o 
4 1 5 

V a 1 

S e r 

T b r 

P r o 
4 2 0 

T h r 

Leu 

V a 1 

G 1 u 

I l e 
4 2 5 

S e r 

Arg 

S e r 

Leu 

G 1 y 
4 3 0 

L y s 

G 1 y 

T b r 

L y s 
4 3 5 

C y s 

C y s 

A 1 a 

L y s 

P r o 
4 4 0 

G I u 

S e r 

G 1 u 

Arg 

Me t 
4 4 5 

P r o 

C y s 

G 1 u 

Asp 
4 5 0 

Ty r 

Leu 

S e r 

Leu 

I 1 e 
4 5 5 

Leu 

A s a 

A r g 

Leu 

C y s 
4 6 0 

V a I 

Leu 

H i s 

L y s 
4 6 5 

T h r 

Pro 

V a 1 

S e r 

G 1 u 
4 7 0 

L y s 

Va 1 

T h r 

L y s 

C y s 
4 7 5 

C y s 

T b r 

G 1 u 

Sei 

Va 1 

A s n 

Arg 

A r g 

P r o 
4 8 5 

C y s 

Pie 

S e r 

Asp 

Leu 
4 9 0 

T h r 

Leu 

Asp 

G 1 u 

T b r 
4 9 5 

Va 1 

P r o 

L y s 

P r o 
5 0 0 

Pie 

Asp 

G I u 

L y s 

Pb e 

5 0 5 

P b c 

T b r 

P h e 

H i s 

A 1 a 
5 I 0 

Asp 

C y s 

T b i 

Leu 
5 1 5 

Pro 

Asp 

T b r 

G I u 

L y s 
5 2 0 

G 1 n 

I 1 c 

L y s 

L y s 

G L n 
5 2 5 

T b r 

A 1 a 

Va 1 

G 1 u 
5 3 0 

Leu 

Leu 

L y s 

H i s 

L y s 
5 3 5 

P r o 

L y s 

A 1 a 

T h r 

Asp 
5 4 0 

G 1 u 

G 1 a 

Leu 

T h r 
5 4 5 

V a 1 

M e t 

G 1 u 

Asa 

Pie 
5 5 0 

V a I 

A 1 a 

P h e 

V a 1 

Asp 
5 5 5 

L y s 

C y s 

Cy s 

A 1 a 

Asp 

Asp 

L y s 

G 1 u 

G I y 
5 6 5 

C y s 

P b e 

V a I 

Leu 

G 1 u 
5 7 0 

G 1 y 

P r o 

L y s 

Leu 

V a 1 
5 7 5 

S e r 

Thr 

G 1 n 

Ala 

Ala 

Leu 

A I a 










5 8 0 


( 2 ) INFORMATION FOR SEQ ID NO:7: 

( i > SEQUENCE CHARACTERISTICS: 

( A ) LENGTH: 584 amino acids 
{ B ) TYPE: amino acid 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: protein 

( i i i ) HYPOTHETICAL: NO 

( i v ) ANTI-SENSE: NO 

( v ) FRAGMENT T YPE: N-temanal 

( i i ) SEQUENCE DESCRIPTION: SEQ ID NO:7: 


G 1 u 
1 

A I a 

H i s 

L y s 

S e r 
5 

G 1 u 

I 1 c 

A 1 a 

H i s 

Arg 
1 0 

P b e 

Ly s 

A s p 

Leu 

G 1 y 
1 5 

G 1 n 

H i s 

Phe 

L y s 
2 0 

G I y 

Leu 

V a 1 

Leu 

1 1 e 

2 5 

A 1 a 

P b e 

S e r 

G l a 

Ty r 
3 0 

Leu 

L y s 

C y s 

Pro 
3 5 

Ty r 

G l u 

G 1 u 

H i s 

I 1 e 
4 0 

Ly s 

Leu 

V a I 

G 1 n 

G 1 u 
4 5 

V a 1 

T h r 

Phe 

A 1 a 
5 0 

L y s 

T b r 

C y s 

V a 1 

A I a 
5 5 

Asp 

G I u 

A s n 

A I a 

G 1 u 
6 0 

Asa 

C y s 

Asp 

5 e r 

6 5 

I I e 

H i s 

Thr 

Leu 

P b e 
7 0 

G 1 y 

Asp 

L y s 

Leu 

Cy s 
7 5 

Ala 

1 1 e 

P r o 

Ly s 

Arg 

Asp 

A s n 

Ty r 

G 1 y 
8 5 

G 1 u 

Leu 

Ala 

Asp 

C y s 
9 0 

C y s 

A ! a 

L y s 

G 1 n 

G 1 u 
9 5 

G 1 u 

Arg 

A s n 

G 1 u 

C y s 

Phe 

Leu 

G 1 n 

H i s 

L y s 

A 9 p 

Asp 

Asa 

Pro 

A s n 


1 0 0 1 0 5 11 0 


Ty r 
4 0 0 

G 1 n 
V a 1 
Thr 
G 1 u 

Leu 

4 8 0 

Ty r 
I 1 e 
Leu 
L y s 

A l a 

5 6 0 

A I a 


G 1 u 

G 1 n 

Asp 

L y s 

Leu 
8 0 

P r o 
Leu 
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Pro 

P r o 

P h e 
1 1 5 

G l n 

A r g 

P r o 

G 1 u 

A 1 a 
1 2 0 

G 1 u 

Ala 

Me t 

Cy s 

T h r 
1 2 5 

S e r 

Phe 

G l n 

G 1 u 

Asn 

1 3 0 

P r o 

T b r 

S e r 

P b e 

Leu 
1 3 5 

G 1 y 

H i s 

Ty r 

Leu 

H i s 
1 4 0 

G 1 u 

V a 1 

Ala 

A r g 

A r g 
1 4 5 

H i s 

P r o 

Ty r 

P h e 

Ty r 
1 5 0 

Ala 

P r o 

G 1 u 

Leu 

Leu 
1 5 5 

Ty r 

Ty I 

A 1 a 

G 1 u 

L y s 
1 6 0 

T y r 

Asn 

G 1 u 

V a 1 

Leu 
1 6 5 

T h r 

G 1 n 

Cy s 

Cy s 

Tb r 
1 7 0 

G 1 u 

S e r 

Asp 

Ly s 

A 1 a 
1 7 5 

A 1 a 

C y s 

Leu 

Th i 

P r o 
1 8 0 

L y s 

Leu 

Asp 

A 1 a 

Va 1 
1 8 5 

Ly s 

G 1 u 

Ly s 

A 1 a 

Leu 
1 9 0 

V a 1 

A 1 a 

Ala 

V a 1 

A r g 
1 9 5 

G l n 

A r g 

Me t 

L y s 

C y s 
2 0 0 

S e r 

Set 

Met 

G 1 n 

A r g 

2 0 5 

Phe 

G 1 y 

G 1 u 

A i g 

A 1 a 
2 1 0 

P h e 

L y s 

Ala 

Tr p 

A 1 a 

2 1 5 

V a 1 

A 1 a 

A r g 

Me t 

S e r 
2 2 0 

G 1 n 

A r g 

Phe 

Pro 

Asn 
2 2 5 

A 1 a 

G l u 

P b e 

A 1 a 

G 1 u 
2 3 0 

1 1 e 

T b i 

L y s 

Leu 

A 1 a 
2 3 5 

T b r 

Asp 

V a 1 

T h r 

L y s 
2 4 0 

1 1 c 

Asn 

L y s 

G 1 u 

Cy s 
2 4 5 

C y s 

H i s 

G 1 y 

Asp 

Leu 
2 5 0 

Leu 

G 1 u 

C y s 

Ala 

Asp 
2 5 5 

Asp 

A r g 

A 1 a 

G 1 u 

Leu 
2 6 0 

A l a 

L y s 

Ty r 

Me t 

Cy s 
2 6 5 

G l u 

Asn 

G 1 n 

A 1 a 

T h r 
2 7 0 

1 1 e 

S e r 

S c r 

L y s 

Leu 
2 7 5 

G 1 n 

A 1 a 

C y s 

C y s 

Asp 
2 8 0 

Ly s 

Pro 

V a l 

Leu 

G 1 n 
2 8 5 

Ly s 

S e r 

G 1 n 

C y s 

Leu 
2 9 0 

A 1 a 

G 1 u 

T h r 

G 1 u 

H i s 
2 9 5 

Asp 

A s n 

I 1 e 

P r o 

A 1 a 
3 0 0 

Asp 

Leu 

Pro 

S e r 

I 1 e 

3 0 5 

A 1 a 

A 1 a 

Asp 

P h e 

V a 1 
3 1 0 

G 1 u 

Asp 

L y s 

G 1 u 

V a 1 
3 1 5 

C y s 

Ly s 

Asn 

Ty r 

Ala 
3 2 0 

G 1 u 

Ala 

Ly s 

Asp 

Va 1 

3 2 5 

P h e 

Leu 

G 1 y 

Th r 

P b e 

3 3 0 

Leu 

Ty r 

G 1 u 

Ty r 

S e r 
3 3 5 

A r g 

A r g 

H i s 

P r o 

Asp 
3 4 0 

Ty r 

S e r 

V a 1 

S e r 

Leu 

3 4 5 

Leu 

Leu 

A r g 

Leu 

A 1 a 
3 5 0 

L y s 

L y s 

Ty x 

G 1 u 

Ala 
3 5 5 

T h r 

Leu 

G 1 u 

L y s 

C y s 
3 6 0 

C y s 

A 1 a 

G 1 u 

G l y 

Asp 
3 6 5 

P T o 

P r o 

A 1 a 

C y s 

Ty r 
3 7 0 

G l y 

T b r 

V a 1 

Leu 

A 1 a 
3 7 5 

G 1 u 

Phe 

G 1 n 

P r o 

Leu 
3 8 0 

V a 1 

G 1 u 

G 1 u 

Pro 

L y s 
3 8 5 

Asn 

Leu 

Va 1 

Ly s 

Th r 
3 9 0 

Asn 

Cy s 

G 1 u 

Leu 

Ty r 
3 9 5 

G 1 u 

Ly s 

Leu 

G 1 y 

G 1 u 
4 0 0 

Ty r 

G 1 y 

P b e 

G 1 n 

Asn 
4 0 5 

A 1 a 

V a 1 

Leu 

Va 1 

A r g 
4 t 0 

Ty r 

T h r 

G 1 n 

Ly s 

A 1 a 
4 I 5 

P r o 

G 1 n 

Va 1 

S e r 

T b r 
4 2 0 

P r o 

T h i 

Leu 

V a 1 

G 1 u 
4 2 5 

Ala 

A 1 a 

A r g 

Asn 

Leu 
4 3 0 

G 1 y 

A r g 

V a 1 

G 1 y 

T h r 
4 3 5 

L y s 

C y s 

Cy s 

T h r 

Leu 
4 4 0 

P r o 

G l u 

A 1 a 

G 1 n 

A r g 
4 4 5 

Leu 

Pro 

Cy s 

V a 1 

G 1 u 
4 5 0 

Asp 

Ty r 

Leu 

S e r 

Ala 
4 5 5 

I i e 

Leu 

Asn 

A r g 

Leu 
4 6 0 

Cy s 

V a 1 

Leu 

H i s 

G 1 u 
4 6 5 

L y s 

T b r 

P r o 

Va 1 

S e r 
4 7 0 

G 1 u 

L y s 

V a 1 

T b r 

Ly s 
4 7 5 

Cy s 

C y s 

S e r 

G 1 y 

S e r 
4 8 0 

Leu 

V a 1 

G 1 u 

A r g 

At g 
4 8 5 

P I o 

C y s 

P h e 

S e r 

A 1 a 
4 9 0 

Leu 

T h r 

V a 1 

Asp 

G 1 u 
4 9 5 

Thr 

Ty r 

V a 1 

P r o 

L y s 
5 0 0 

G 1 u 

P b e 

L y s 

Ala 

G 1 u 
5 0 5 

Th r 

Phe 

T h r 

Phe 

H i s 
5 1 0 

S e r 

A s p 

I 1 e 

Cy s 

T h r 
5 1 5 

Leu 

P r o 

Asp 

Ly s 

G 1 u 
5 2 0 

Ly s 

G 1 n 

I 1 e 

Ly s 

L y s 
5 2 5 

G 1 n 

T h r 

A 1 a 

Leu 

A 1 a 

5 3 0 

G t u 

Leu 

V a 1 

L y s 

H i s 
5 3 5 

L y s 

P r o 

L y s 

Ala 

T h r 
5 4 0 

G 1 u 

A s p 

G 1 n 

Leu 




5.780.594 

25 26 

-continued 


L y s 
5 4 5 

T h r 

V a 1 

Me t 

G 1 y 

Asp 
5 5 0 

Pie 

A 1 a 

G 1 n 

Pie 

Va 1 
5 5 5 

Asp 

L y s 

Cy s 

Cy 

s 

L y s 
5 6 0 

A l a 

A 1 a 

Asp 

L y s 

A s p 
5 6 5 

A s n 

Cy s 

P h e 

A 1 a 

t h i 

5 7 0 

G 1 u 

G 1 y 

P r o 

A s n 

L e 
5 7 

u 

5 

V a 1 

Ala 

A r g 

S e r 

L y s 
5 8 0 

G I u 

A 1 a 

Leu 

A 1 a 











( 2 > INFORMATION FOR SEQ ID NO:8: 

( i ) SEQUENCE CHARACTERISTICS: 

( A ) LENGTH: 579 amino acids 
( B ) TYPE: amino acid 
( D ) TOPOLOGY: linear 

< i i ) MOLECULE TYPE: protein 

( i i i ) HYPOTHETICAL: NO 

( i v ) ANTI-SENSE: NO 


( v ) FRAGMENT TYPE: N-terminal 
( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:8: 


V a I 

Asp 

H i s 

H i s 

Ly s 
5 

H i s 

1 1 e 

A 1 a 

Asp 

Me t 
1 0 

T y r 

Asn 

Leu 

Leu 

T b r 
1 5 

G 1 u 

A T g 

T h r 

Pie 

Ly s 
2 0 

G 1 y 

Leu 

T h x 

Leu 

A 1 a 
2 5 

I 1 e 

Va 1 

S c i 

G 1 n 

Asn 

3 0 

Leu 

G 1 n 

L y s 

C y s 

S e r 

3 5 

Leu 

G 1 u 

G 1 u 

Leu 

Sex 
4 0 

L y s 

Leu 

V a 1 

Asn 

G 1 u 
4 5 

I 1 e 

Asn 

Asp 

P h e 

A 1 a 
5 0 

Ly s 

Sex 

Cy s 

T b r 

G 1 y 
5 5 

A s n 

Asp 

L y s 

T h x 

P r o 
6 0 

G 1 u 

C y s 

G 1 u 

L y s 

Pro 
6 5 

I 1 e 

G l y 

T h r 

Leu 

Pie 

7 0 

Ty x 

Asp 

Ly s 

Leu 

C y s 
7 5 

Ala 

Asp 

P r o 

L y s 

V a 1 
8 0 

G 1 y 

V a 1 

A s n 

T y i 

G 1 u 
8 5 

T r p 

S e i 

L y s 

G 1 u 

Cy s 
9 0 

C y s 

S e r 

Ly s 

Gin 

Asp 
9 5 

P r o 

G I u 

A r g 

A 1 a 

G 1 n 
1 0 0 

Cy s 

Ph e 

A i g 

Ala 

H i s 
1 0 5 

A x g 

V a 1 

Pbe 

G 1 u 

H k s 
1 1 0 

Asn 

P r o 

Va 1 

A r g 

P x o 
1 1 5 

L y s 

P I o 

G 1 u 

G 1 u 

T h r 
1 2 0 

C y s 

Ala 

Leu 

P h c 

L y s 
1 2 5 

G 1 u 

H i s 

Pro 

Asp 

Asp 
1 3 0 

Leu 

Leu 

S e r 

Ala 

P b e 

1 3 5 

I 1 e 

H i s 

G 1 u 

G 1 u 

A 1 a 
1 4 0 

A r g 

Asn 

H i s 

P r o 

Asp 
1 4 5 

Leu 

Ty r 

P x o 

Pro 

A 1 a 
1 5 0 

Va 1 

Leu 

Leu 

Leu 

T h r 
1 5 5 

G 1 n 

G 1 n 

Ty r 

G 1 y 

L y s 
1 6 0 

Leu 

V a t 

G 1 u 

H i s 

C y s 
1 6 5 

Cy s 

G 1 u 

G I u 

G 1 u 

Asp 
1 7 0 

L y s 

Asp 

L y s 

C y s 

Pie 
1 7 5 

A 1 a 

G 1 u 

L y s 

Me t 

L y s 
1 8 0 

G 1 u 

Leu 

Me t 

L y s 

H i s 
1 8 5 

S e r 

H i s 

S e r 

I 1 e 

G 1 u 
1 9 0 

Asp 

L y s 

G 1 n 

L y s 

H i s 
1 9 5 

Pie 

C y s 

T x p 

I 1 e 

V a 1 
2 0 0 

As ft 

A s n 

T y r 

P r o 

G 1 a 
2 0 5 

A r g 

V a I 

I 1 e 

L y s 

Ala 

2 I 0 

Leu 

A s n 

Leu 

A I a 

A r g 
2 1 5 

V a 1 

Sex 

H i s 

A r g 

Ty r 
2 2 0 

P r o 

L y s 

Pro 

Asp 

Pie 

2 2 5 

L y s 

Leu 

A 1 a 

H i s 

L y s 
2 3 0 

Pie 

T h r 

G I u 

G 1 u 

T b r 
2 3 5 

T b x 

H i s 

Pbe 

1 l e 

L y s 
2 4 0 

Asp 

C y s 

C y s 

H i s 

G 1 y 
2 4 5 

Asp 

Me t 

P b e 

G 1 u 

C y s 
2 5 0 

Me t 

T h i 

G 1 u 

A r g 

Leu 
2 5 5 

G I u 

Leo 

Sex 

G 1 u 

H i s 
2 6 0 

Tt x 

C y s 

G I n 

H i s 

L y s 
2 6 5 

Asp 

G 1 u 

Leu 

S e r 

T h i 
2 7 0 

L y s 

Leu 

G 1 u 

L y s 

Cy s 

C y s 

A s n 

Leu 

P x o 

Leu 

Leu 

G 1 u 

A r g 

T h r 

Ty r 

C y s 

I 1 e 

Va 1 


27 


5.780,594 


28 


-continued 

275 280 285 


T h t 

Leu 
2 9 0 

G 1 u 

A s n 

Asp 

Asp 

V a 1 
2 9 5 

P r o 

A I a 

G 1 u 

Leu 

S e r 
3 0 0 

L y s 

P I o 

I l e 

T h r 

G 1 u 
3 0 5 

Pie 

T h r 

G 1 u 

Asp 

P r o 
3 1 0 

H i s 

V a 1 

C y s 

G 1 b 

L y s 
3 1 5 

T y r 

A 1 a 

G i u 

A s n 

Ly s 
3 2 0 

S e r 

Pie 

Leu 

G I u 

I t e 

3 2 5 

S e r 

P r o 

Tr p 

G 1 n 

S e r 
3 3 0 

G 1 n 

G I u 

T b r 

P r o 

G t u 
3 3 5 

L c u 

S e r 

G l u 

G 1 n 

P b e 
3 4 0 

Leu 

Leu 

G I n 

S e i 

A 1 a 
3 4 5 

L y s 

G 1 u 

Ty r 

G 1 u 

S e r 
3 5 0 

Leu 

Leu 

A s n 

L y s 

C y s 
3 5 5 

Cy s 

P b e 

S e r 

Asp 

A s n 
3 6 0 

P I o 

P T O 

G I u 

C y s 

Ty r 
3 6 5 

L y s 

Asp 

G l y 

A I a 

Asp 
3 7 0 

A r g 

P b e 

Me t 

A s n 

G 1 u 
3 7 5 

A 1 a 

L y s 

G I u 

A r g 

P h e 
3 8 0 

A 1 a 

Ty r 

Leu 

Ly s 

G I n 
3 8 5 

A s n 

Cy s 

A s p 

I I e 

Leu 
3 9 0 

H i 8 

G 1 u 

H i s 

G 1 y 

G 1 u 
3 9 5 

T y r 

Leu 

Pie 

G L u 

A s n 
4 0 0 

G I u 

Leu 

Leu 

I 1 e 

A r g 

4 0 5 

Ty r 

T h r 

Ly s 

L y s 

Me t 
4 i 0 

P I o 

G 1 n 

V a I 

S e i 

Asp 
4 1 5 

G l u 

T h r 

Leu 

1 1 e 

G 1 y 
4 2 0 

I 1 e 

Ala 

H i s 

G 1 n 

Me t 
4 2 5 

A 1 a 

Asp 

I I e 

G 1 y 

G 1 u 
4 3 0 

H L s 

C y s 

Cy s 

Ala 

V a 1 
4 3 5 

P r o 

G l u 

A s n 

G I n 

Aig 
4 4 0 

Met 

P T o 

C y s 

A l a 

G 1 u 
4 4 5 

G 1 y 

Asp 

Leu 

T h r 

I 1 e 
4 5 0 

Leu 

I I e 

G I y 

L y s 

Me t 
4 5 5 

C y s 

G 1 u 

A T g 

G l n 

L y s 
4 6 0 

Ly s 

Thr 

P b e 

I I e 

A s n 
46 5 

A s n 

H i s 

V a 1 

A 1 a 

H i s 

4 7 0 

C y s 

C y s 

T b r 

A s P 

S e r 
4 7 5 

Ty i 

S c T 

G I y 

Me t 

A r g 
4 8 0 

S e r 

C y s 

P h e 

T h r 

A I a 
4 8 5 

Leu 

G 1 y 

P r o 

Asp 

G 1 u 
4 9 0 

Asp 

Ty r 

V a 1 

P r o 

P r o 
4 9 5 

P r o 

V a 1 

T b r 

Asp 

Asp 
5 0 0 

T h r 

P h e 

H i s 

Pie 

Asp 
5 0 5 

Asp 

Ly s 

I 1 e 

C y s 

T b r 
5 1 0 

A I a 

A s n 

Asp 

L y s 

G 1 u 
5 1 5 

L y s 

G 1 n 

H i s 

I 1 e 

L y s 
5 2 0 

G I n 

L y s 

P h e 

Leu 

V a I 
5 2 5 

L y s 

Leu 

I 1 e 

L y s 

V a 1 
5 3 0 

S c r 

P I o 

L y s 

Leu 

G 1 u 
5 3 5 

Ly s 

A s n 

H i s 

I I e 

Asp 
5 4 0 

G 1 u 

Tr P 

Leu 

Leu 

G 1 u 
5 4 5 

P b e 

Leu 

L y s 

Me t 

V a 1 
5 5 0 

G I n 

L y s 

C y s 

Cy s 

T h r 
5 5 5 

A I a 

A s p 

G 1 u 

H i s 

G 1 n 
5 6 0 

P r o 

C y s 

P b c 

Asp 

T h r 
5 6 5 

G l u 

L y s 

P I o 

Va 1 

Leu 
5 7 0 

I 1 c 

G 1 u 

H i s 

C y s 

G 1 n 
5 7 5 

L y s 

Leu 

H i s 

P r o 















( 2 ) INFORMATION FOR SEQ ID NOS: 

( i ) SEQUENCE CHARACTERISTICS: 

( A ) LENGTH: 590 amino acids 
( B ) TYPE: amino acid 
( D ) TOPOLOGY: line* 

( i i ) MOLECULE TYPE: protein 

( i i i ) HYPOTHETICAL: NO 


( i v ) ANTI-SENSE: NO 
< v ) FRAGMENT TYPE: N-temrinal 
(si) SEQUENCE DESCRIPTION: SEQ ID NOS: 


S e r 
1 

G 1 n 

A 1 a 

G I n 

A s n 
5 

Gin I ] 

1 e C y s 

Th r 

lie P b e 
1 0 

Thr 

G 1 u 

A I a 

Ly s 
1 5 

G 1 u 

Asp 

G 1 y 

P b e 

L y s 

S e r 

Leu II 

i e Leu 

V a 1 

G 1 y Leu 

A 1 a 

G 1 u 

A s n 

Leu 

P r o 
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Asp 

S e r 

T h r 
3 5 

Leu 

G 1 y 

Asp 

Leu 

V a 1 
4 0 

Me t 

G 1 y 
5 0 

V a ] 

L y s 

C y s 

C y s 

S e r 
5 5 

Asp 

Asp 

6 5 

Va l 

A 1 a 

Asp 

Leu 

P b e 
7 0 

G 1 n 

S e i 

V a 1 

G 1 u 

L y s 

A s n 

Asp 
8 5 

Leu 

L y s 

Me t 

A r g 

T h r 

H i s 

C y s 
1 0 0 

P b e 

V a 1 

Asp 

H i s 

S e r 

Leu 

L y s 
1 1 5 

A 1 a 

G 1 u 

Leu 

P r o 

A 1 a 
1 2 0 

L y s 

Asp 
1 3 0 

H i s 

L y s 

Ala 

P h e 

V a 1 

1 3 5 

G l y 

S e r 
1 4 5 

Asa 

P r o 

Me t 

Leu 

P T 0 
1 5 0 

P r o 

H i s 

Ty r 

G 1 y 

G 1 u 

V a 1 

Leu 
1 6 5 

T b r 

T h t 

C y s 

C y s 

P b e 

Asp 

T h r 
1 8 0 

Ly s 

L y s 

A 1 a 

T h r 

V a I 

A 1 a 

G 1 u 
1 9 5 

Leu 

A r g 

S c r 

Leu 

C y s 
2 0 0 

A i g 

V a 1 
2 1 0 

V a 1 

L y s 

Ala 

L y s 

Ly s 
2 1 5 

Leu 

G l n 
2 2 5 

A 1 a 

S e r 

P b c 

G 1 n 

G 1 u 
2 3 0 

Me t 

G 1 y 

T h r 

V a 1 

A 1 a 

P I o 

C y s 
2 4 5 

C y s 

S e r 

G I y 

A i g 

L y s 

T b r 

Leu 
2 6 0 

V a 1 

Asp 

G I u 

Va 1 

A r g 

Ala 

Ala 
2 7 5 

G l y 

Leu 

S e r 

A 1 a 

Cy s 
2 8 0 

G 1 y 

S e r 
2 9 0 

C y s 

V a 1 

G 1 u 

A 1 a 

Me t 
2 9 5 

L y s 

S e r 
3 0 5 

G 1 u 

H i s 

Ty r 

Asp 

I 1 e 
3 1 0 

H i s 

A 1 a 

P h e 

T h r 

L y s 

P r o 

T h r 
3 2 5 

Asp 

V a 1 

A 1 a 

S e r 

V a 1 

A r g 

H i s 
3 4 0 

P r o 

G 1 u 

Sc r 

S c r 

L y s 

G 1 u 

A 1 a 
3 5 5 

G 1 u 

G l n 

A 1 a 

Leu 

Leu 
3 6 0 

A I a 

G 1 u 
3 7 0 

C y s 

V a 1 

L y s 

T h r 

A ] a 

3 7 5 

Leu 

r 1 e 
3 8 5 

T h r 

Asp 

G 1 u 

T h r 

Asp 
3 9 0 

Ty r 

Ty r 

Ala 

V a 1 

S e r 

Asp 

Asp 
4 0 5 

S e r 

P b e 

G 1 u 

A r g 

I 1 e 

Me t 

P r o 
4 2 0 

G 1 n 

Ala 

S e T 

Phc 

T b r 

V a 1 

H i s 
4 3 5 

Asp 

V a 1 

Leu 

H i s 

A 1 a 
4 4 0 


P r o 

Leu 

1 1 e 

A l a 

G 1 u 
4 5 

A 1 a 

Leu 

A i a 

T h r 

P r o 

P T o 

G 1 u 
6 0 

Asp 

C y s 

G 1 u 

A i g 

Ala 

Va l 

C y s 
7 5 

S e t 

S e t 

G 1 u 

T h r 

Leu 
8 0 

C y s 

C y s 
9 0 

G 1 u 

L y s 

T h r 

A l a 

A 1 a 
9 5 

G 1 u 

Ly s 
1 0 5 

Ala 

L y s 

I l e 

P r o 

A r g 
1 1 0 

Asp 

Leu 

A 1 a 

Asp 

G 1 n 

Cy s 

G I u 
1 2 5 

Asp 

P h e 

Ly s 

A T g 

P h e 

1 l e 

P b e 
1 4 0 

L y s 

P b e 

S e i 

L y s 

Va 1 

V a l 

Leu 
1 5 5 

A 1 a 

1 1 e 

A 1 a 

L y s 

G 1 y 
1 6 0 

C y s 

G 1 y 
1 7 0 

G 1 u 

A 1 a 

G 1 u 

A 1 a 

G 1 n 
1 7 5 

T b r 

Phc 
1 8 5 

G 1 A 

H i s 

A 1 a 

V a 1 

M e t 
1 9 0 

L y s 

A r g 

I 1 e 

V a 1 

H i s 

Ly s 

Ly s 
2 0 5 

Ty r 

G 1 y 

Asp 

Va 1 

G 1 n 

Ty r 

S e r 
2 2 0 

G 1 n 

Ly s 

Me t 

P r o 

G 1 y 

Me t 

Va 1 
2 3 5 

Asp 

Ly s 

I 1 e 

V a 1 

A 1 a 
2 4 0 

Asp 

Me t 

2 5 0 

Va 1 

T b i 

C y s 

Me t 

L y s 
2 5 5 

G 1 u 

C y s 
2 6 5 

Ala 

Asp 

G 1 u 

S e r 

V a 1 
2 7 0 

Leu 

S e r 

Cy s 

L y s 

G 1 u 

Asp 

A 1 a 
2 8 5 

V a 1 

H i s 

A r g 

P r o 

Asp 

P r o 

L y s 
3 0 0 

P r o 

Asp 

G 1 y 

Leu 

Asp 

I l e 

A 1 a 
3 1 5 

A 1 a 

V a 1 

C y s 

G 1 u 

T b r 

3 2 0 

Me t 

G 1 y 
3 3 0 

L y s 

Leu 

V a 1 

Ty r 

G 1 u 
3 3 5 

I 1 e 

G 1 n 

3 4 5 

G 1 n 

Va 1 

I 1 c 

Leu 

A T g 
3 5 0 

Phe 

Ala 

G 1 n 

C y s 

Cy s 

Asp 

Me t 
3 6 5 

G 1 u 

Asp 

H i s 

Ala 

G 1 y 

S e r 

Asp 
3 8 0 

I 1 e 

Asp 

L y s 

L y s 

L y s 

L y s 

Me t 
3 9 5 

C y s 

A 1 a 

A 1 a 

G 1 u 

A 1 a 
4 0 0 

L y s 

S e r 
4 1 0 

Me t 

Me t 

Va 1 

Ty r 

Ty r 
4 1 5 

T b r 

Asp 
4 2 5 

G 1 n 

Leu 

H i s 

Me t 

V a 1 

4 3 0 

S e r 

G 1 n 

C y s 

C y s 

L y s 

Asp 

G 1 u 

G 1 Q 

G 1 y 

H i s 


4 4 5 




5,780,594 

31 32 


-continued 


p h 

e 

V a 

1 

L 

c 

u 

P r o 

C y s 

A 1 a 

G 1 u 

G 1 

u 

Ly 

s 

L e 

u 

T h 

r 

Asp 

A 1 a 

I 1 c 

Asp 

A 1 

a 



4 5 

0 







4 5 5 









4 6 0 






t h 

r 

Cy 

s 

A 

s 

p 

Asp 

T y r 

Asp 

P r o 

S e 

r 

S e 

r 

I 1 

e 

A s 

& 

P r o 

H i s 

1 1 e 

Ala 

H i 

s 

4 6 

5 








4 7 0 








4 7 

5 





4 8 

0 

cy 

s 

Cy 

s 

A 

s 

n 

G 1 n 

Sei 

Ty r 

S e r 

Me 

t 

A T 

8 

A r 

8 

H i 

s 

C y s 

I 1 e 

Leu 

Ala 

I 1 

e 









4 8 5 







4 9 

0 






4 9 5 



G I 

n 

P r 

o 

A 

s 

P 

T fa r 

G 1 u 

Pht 

T h i 

P r 

o 

P r 

o 

G 1 

u 

L e 

u 

Asp 

A 1 a 

Ser 

Ser 

p h 

e 








5 0 0 






5 0 

5 







5 1 0 




H i 

s 

M e 

t 

G 

i 

y 

Pro 

G 1 u 

Leu 

C y s 

T h 

r 

Ly 

s 

A s 

P 

S e 

r 

L y s 

Asp 

Leu 

Leu 

L e 

u 





5 

I 

5 





5 2 

0 








5 2 5 





S e 

r 

G 1 

y 

L 

y 

s 

Ly s 

Leu 

Leu 

Ty r 

G 1 

y 

Va 

1 

V a 

1 

A r 

8 

H i s 

Ly . 

T b r 

T h r 

I l 

e 



5 3 

0 







5 3 5 









5 4 0 






T b 

r 

G 1 

u 

A 

s 

P 

H i s 

Leu 

L y s 

T b r 

I 1 

e 

S e 

r 

T h 

r 

L y 

s 

Ty r 

H i s 

T b r 

Me t 

Ly 

s 

5 4 

5 








5 5 0 








5 5 

5 





5 6 

0 

G 1 

u 

Ly 

s 

C 

y 

s 

C y s 

A 1 a 

A I a 

G I u 

A s 

P 

G 1 

n 

A 1 

a 

A 1 

a 

C y s 

Phe 

T h r 

G 1 u 

G 1 

u 









5 6 5 







5 7 

0 






5 7 5 



A 1 

a 

P r 

o 

L 

y 

s 

Leu 

V a 1 

Ser 

G 1 u 

S e 

r 

A 1 

a 

G 1 

u 

L e 

u 

V a 1 

Ly s 

Va 1 











5 8 0 






5 8 

5 







5 9 0 





What is claimed is: 

1. A serum albumin protein fragment consisting of at least 
one serum albumin binding region selected from the group 
consisting of binding region subdomain HA and binding 
region subdomain IHA. 

2. A serum albumin protein fragment according to claim 
1 wherein the serum albumin binding region consists of 
binding region subdomain HA, 

3. A serum albumin protein fragment according to claim 
1 wherein the serum albumin binding region consists of 
binding region subdomain IHA. 

4. A serum albumin protein fragment according to claim 
1 wherein the serum albumin binding region consists of 
binding region subdomains HA. I3B and iHA. 

5. A serum albumin protein fragment according to claim 
1 wherein the serum albumin binding region is a binding 
region of a serum albumin selected from the group consist- 
ing of human, bovine, equine, ovine, rat, frog, sheep, 
salmon, mouse, and sea lamprey serum albumin proteins. 


6. A serum albumin protein fragment according to claim 
5 wherein the serum albumin binding region is a human 
serum albumin binding region. 

7. A serum albumin protein fragment according to claim 
5 wherein the serum albumin binding region is an equine 
serum albumin binding region. 

8. A serum albumin protein fragment according to claim 
5 wherein the serum albumin binding region is a bovine 
serum albumin binding region. 

9. A serum albumin protein fragment according to claim 
8 wherein the serum albumin binding region consists of SEQ 
ID NO: 1. 

10. A serum albumin protein fragment according to claim 
8, wherein the serum albumin binding region consists of 
SEQ ID NO:2. 

11. A serum albumin protein fragment according to claim 
4 wherein die serum albumin binding region consists of 
amino acids 190 to 494 of SEQ ID NO:4. 

* * * * * 
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